cDNAs encoding polypeptides

ABSTRACT

This invention relates to an isolated nucleic acid fragment encoding a phospholipase D. The invention also relates to the construction of a chimeric gene encoding all or a substantial portion of the phospholipase D, in sense or antisense orientation, wherein expression of the chimeric gene results in production of altered levels of the phospholipase D in a transformed host cell.

[0001] This application claims the benefit of U.S. ProvisionalApplication No. 60/143,410, filed Jul. 12, 1999; U.S. ProvisionalApplication No. 60/143,409, filed Jul. 12, 1999; U.S. ProvisionalApplication No. 60/153,534, filed Sep. 13, 1999; U.S. ProvisionalApplication No. 60/143,400, filed Jul. 12, 1999; U.S. ProvisionalApplication No. 60/161,223, filed Oct. 22, 1999; U.S. ProvisionalApplication No. 60/159,878, filed Oct. 15, 1999; and U.S. ProvisionalApplication No. 60/157,401, filed Oct. 01, 1999, all of which areincorporated herein by reference.

FIELD OF THE INVENTION

[0002] This invention is in the field of plant molecular biology. Morespecifically, it relates to nucleic acid sequences, the amino acidssequences encoded by such nucleic acids, and methods for modulatingtheir expression in plants.

BACKGROUND OF THE INVENTION

[0003] Reactive oxygen metabolites are produced as a response topathogen attack in most organisms including bacteria, mammals andplants. Superoxide and hydrogen peroxide are generated by anNADPH-dependent oxidase. In humans this plasma membrane oxidase isformed of two subunits gp91^(phox) and p22^(phox) which act togetherwith three cytosolic proteins p40^(phox), p47^(phox) and p67^(phox) toform an active complex. An Arabidopsis thaliana gene encoding arespiratory burst oxidase homolog A (RbohA) with similarity to the humangp91^(phox) but also containing an amino-terminal domain with twocalcium binding motifs has been described. The predicted amino acidsequence from this Arabidopsis thaliana gene contains binding sites andtransmembrane domains which are conserved with the rice RbohA (Keller,T. et al. (1998) Plant Cell 10:255-266). At least 6 differentArabidopsis thaliana homologs, named RbohA, RbohB, RbohC, RbohD, RbohE,and RbohF, have been identified for the human gp91^(phox) (Torres et al.(1998) Plant J. 14:365-370).

[0004] There are multiple, possibly redundant or synergistic pathways inresponse to a pathogen attack. Understanding the genes involved willallow the study of stress response and the engineering of plants withstress and disease resistance.

[0005] Transfer RNA from all organisms typically contains severalmodified nucleosides, in addition to the standard guanosine, adenosine,cytidine, and uridine. These modified bases are important for tRNAfolding and function. One group, 5-methylaminomethyl-2-thiouridylate, isfound in the “wobble position” of the tRNA anticodon sequence. Themodification is apparently important for the stabilization of tRNApairing to the codon. Mutations inhibiting the base modification lead toloss of translational fidelity (Hagervall and Bjork (1984) Mol. Gen.Genet. 196:194-200). The enzyme that performs this modification is tRNA(5-methylaminomethyl-2-thiouridylate)-methyltransferase, also calledtRNA-mnm⁵s²U-MT. Mutations in this enzyme can adversely affecttranslational regulation and can lead to lethality. Due to the lethalphenotype found in mutant genes, these are potential targets forherbicide treatment in plants, thus they will be useful for herbicidediscovery and design.

[0006] Cytosine methylation is the most common modification of DNA foundin nature. Cytosine methylation has been implicated in the control ofmany cellular processes including development, DNA repair, chromatinorganization, transcription, recombination and replication. Cytosine5-methyltransferase has been proposed to play a role in generalbiological processes such as cellular aging (Tollefsbol et al. (1993)Med. Hypotheses 41:83-92), carcinogenesis (Jones et al. (1990) Adv.Cancer Res. 54:1-23), human genetic diseases (Cooper et al. (1988) Hum.Genet. 78:151-155), and evolution (Sved et al. (1990) Proc. Natl. Acad.Sci. U.S.A. 87:4692-4696).

[0007] Another type of DNA methylation protein is chromomethylase. Eightdifferent chromometylases have been identified in Arabidopsis thaliana(Henikoffet al. (1998) Genetics 149:307-318). These proteins have commonchromodomains that are thought to mediate protein-protein interactionsbetween various chromatin molecules. Chromomethylase may also beinvolved in controlling many cellular processes.

[0008] There is a great deal of interest in identifying the genes thatencode proteins involved in DNA methylation in plants. These genes maybe used in plant cells to control the cell development, transcriptionand DNA replication. Accordingly, the availability of nucleic acidsequences encoding all or a substantial portion of a DNAmethyltransferase would facilitate studies to better understand DNAmethylation in plants and provide genetic tools to inhibit or otherwisealter DNA methyltransferase activity which in turn could providemechanisms to control cell development, transcription, DNA replicationand other cellular processes in plant cells.

[0009] Phospholipase D (PLD; EC 3.1.4.4) catalyzes the breakdown ofglycerophospholipids to produce choline and a phosphatidate. Originallyconsidered to exist only in plants, PLDs also have been found in mammalsand microorganisms. These enzymes have been proposed to play importantroles in transmembrane signaling, vesicle traffic, and responses tointernal and external stress. The first identified PLD (now calledPLD-alpha) does not need polyphosphoinositide as a cofactor and showshigher activity in the presence of millimolar calcium concentrations.Two other PLDs identified in Arabidopsis thaliana (PLD-beta andPLD-gamma) require polyphosphoinositide as a cofactor and requiremicrogram amounts of calcium for proper activity (Pappan et al. (1997)J. Biol. Chem. 272:7048-7054). These Arabidopsis thaliana PLDs have beenfurther characterized and shown to have different biochemicalproperties. PLD-alpha and PLD-gamma fractionate with the plasmamembrane, mitochondria, clathrin coated vesicles and intracellularmembranes from Arabidopsis thaliana leaves. PLD-gamma is also found inthe nuclear fraction while the amount of PLD-beta present makes itdifficult to detect in subcellular fractions.

[0010] Genes encoding PLD-alpha from corn and rice have been previouslyidentified (Ueki et al. (1995) Plant Cell. Physiol. 36:903-914). Genesencoding PLD-beta and PLD-gamma have only been identified in Arabidopsisthaliana. Identification of the genes encoding PLD-alpha in soybean andwheat and PLD-gamma in corn and soybean will enable the study ofmembrane signaling and stress response in agriculturally importantcrops. Lysophospholipids are incorporated within wheat starch granulesduring starch biosynthesis and phospholipase is implicated in theformation of lysophospholipid from phosphatidylcholine. Thus,manipulation of this biosynthetic pathway could enable the starch lipidcontent to be altered, generating starches with novel functionalproperties.

[0011] In eukaryotes transcription initiation requires the action ofseveral proteins acting in concert to initiate mRNA production. Twocis-acting regions of DNA have been identified that bind transcriptioninitiation proteins. The first binding site, located approximately 25-30bp upstream of the transcription initiation site, is termed the “TATAbox”. The second region of DNA required for transcription initiation isthe upstream activation site (UAS) or enhancer region. This region ofDNA is somewhat distal from the TATA box. During transcriptioninitiation, RNA polymerase II is directed to the TATA box by generaltranscription factors. Transcription activators, which have both a DNAbinding domain and an activation domain, bind to the UAS region andstimulate transcription initiation by physically interacting with thegeneral transcription factors and RNA polymerase. Direct physicalinteractions have been demonstrated between activators and generaltranscription factors in vitro (Triezenberg et al. (1988) Gene Dev.2:718-729; Stringer et al. (1990) Nature 345:783-786; Lin et al. (1991)Nature 353:569-571; Xiao et al. (1994) Mol. Cell. Biol. 14:7013-7024).One general transcription factor, TFIIF, has been shown to bind to RNApolymerase II and with the help of TFIIB, recruit RNA polymerase II tothe initiation complex. Transcription factor TFIIF is one of the largerinitiation factors, being composed of a tetramer consisting of two largealpha subunits and two small beta subunits (Gong et al. (1995) NucleicAcids Res. 23:1182-1186).

[0012] It is thought that adaptor proteins serve to mediate theinteraction between transcriptional activators and general transcriptionfactors. Functional and physical interactions have also beendemonstrated between the activators and various transcription adaptors.These transcription adaptors do not normally bind directly to DNA, butthey can “bridge” the interaction between transcription activators andgeneral transcription factors (Pugh and Tjian (1990) Cell 61:1187-1197;Kelleher et al. (1990) Cell 61:1209-1215; Berger et al. (1990) Cell61:1199-1208).

[0013] Accordingly, the availability of nucleic acid sequences encodingall or a substantial portion of TFIIF alpha and/or beta subunits willfacilitate studies to better understand transcription initiation inplants and ultimately will provide methods to engineer mechanisms tocontrol transcription.

[0014] Aminoacyl-tRNA synthetases ensure the fidelity of proteinbiosynthesis by aminoacetylating tRNAs. There are at least 20 differentaminoacyl-tRNA synthetases (one per amino acid). The firstasparaginyl-tRNA synthetase gene from a higher plant (plants other thanyeast) was identified in Arabidopsis thaliana chromosome IV (Aubourg etal. (1998) Biochim. Biophys. Acta 1398:225-231). A cDNA encoding Lupinusluteus Glutaminyl-tRNA synthetase has been characterized (NCBI GeneralIdentifier No. 3915866). Identification of aminoacyl-tRNA synthetases inother plants will be useful to develop herbicide-resistant plants andfor the discovery and design of new herbicides.

[0015] Plant defenses are activated by an interaction between the plantresistance (R) gene and the pathogen avirulence (avr) gene. The precisemode of interaction between R and avr has not been elucidated to date.The cDNAs encoding R genes from several monocot and dicot species havebeen identified. The mechanism of transduction of the R gene signal hasbeen studied using screens for mutations that affect disease resistanceor that affect specific defense responses and using the yeast two hybridsystem. These analyses have resulted in the idea that the R genetransduction pathways are highly branched (Innes (1998) Curr. Opin.Plant Biol. 1:229-304). Using a mutational approach, a recessivemutation called eds1 (enhanced disase susceptibility 1) was identifiedin Arabidopsis thaliana which abolishes the resistance to Peronosporaparasitica in the Wassilewskija (Ws-0) background (Parker et al. (1996)Plant Cell 8:2033-2046). The EDS1 protein was shown to be indispensablefor the function of the major class of R genes and contains a C-terminalregion with similarities to eukaryotic lipases (Falk, et al. (1999)Proc. Natl. Acad. Sci. USA 96:3292-3297). Identification of EDS1 inother plants such as the rice, soybean, and wheat disclosed herein willallow the study of the transduction mechanism.

[0016] Adaptins are components of the complexes which link clathrin toreceptors in coated vesicles. Clathrin-associated protein complexes arebelieved to interact with the cytoplasmic tails of membrane proteinsleading to their selection and concentration. The plasma membraneadaptor (AP2) is a heterologous tetrameric complex composed of two largechains (alpha adaptin and beta adaptin), a medium chain (AP50), and asmall chain (AP17). This adaptor complex is a component of the coatsurrounding the cytoplasmic face of the coated vesicles in the plasmamembrane. The cDNAs encoding two alpha adaptins have been isolated frommouse brain (Robinson (1989) J. Cell. Biol. 108:833-842) and a cDNAclone (Accession No. AF009631) encoding a protein homologous to the themicro-adaptins of clathrin-coated vesicle adaptor complexes has beenidentified in Arabidopsis thaliana. There are two beta adaptin subtypes,beta adaptin and beta′ adaptin. The beta′ adaptins from Homo sapienshave been studied and their loss of expression is thought to be involvedin meningioma production (Peyrard et al. (1994) Hum. Mol. Genet.3:1393-1399). Beta′ adaptin homologs have been identified in thesequencing projects for Drosophila melanogaster and Arabidopsisthaliana. The cDNAs encoding the 50 kDa subunit from AP2 (AP50) havebeen isolated from rat brain. Determination of the nucleotide sequenceallowed comparison with other known AP50s. This comparison showed thatAP50s are highly conserved although there are no significantsimilarities with other kinases or known proteins (Thurieau et al.(1988) DNA 7:663-669).

[0017] Identification of the sequences encoding the different adaptorsubunits from a variety of crops may be useful for engineeringendocytosis, and stimulating or increasing secretion in plants.

SUMMARY OF THE INVENTION

[0018] Generally, it is the object of the present invention to providepolynucleotides and polypeptides relating to phospholipases. It is anobject of the present invention to provide transgenic plants comprisingthe nucleic acids of the present invention, and methods for modulating,in a transgenic plant, expression of the polynucleotides of the presentinvention.

[0019] The present invention concerns are isolated nucleic acid encodinga polypeptide selected from the group consisting of SEQ ID NOs: 2, 4, 6,8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42,44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78,80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110,112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138,140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166,168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194,and 196 and the complement of such sequences.

[0020] The present invention concerns an isolated polynucleotidecomprising a nucleotide sequence selected from the group consisting of:(a) a first nucleotide sequence encoding a polypeptide of at least 80amino acids having at least 92% identity based on the Clustal method ofalignment when compared to a polypeptide selected from the groupconsisting of SEQ ID NOs:120, 122, 124, 126, 128, 130, 132, and 134, and(b) a second nucleotide sequence comprising the complement of the firstnucleotide sequence.

[0021] In a second embodiment, it is preferred that the isolatedpolynucleotide of the claimed invention comprises a nucleotide sequencewhich comprises a nucleic acid sequence selected from the groupconsisting of SEQ ID NOs:119, 121, 123, 125, 127, 129, 131, and 133.

[0022] In a third embodiment, this invention concerns an isolatedpolynucleotide comprising a nucleotide sequence of at least one of 60(preferably at least one of 40, most preferably at least one of 30)contiguous nucleotides derived from a nucleotide sequence selected fromthe group consisting of SEQ ID NOs:119, 121, 123, 125, 127, 129, 131,and 133 and the complement of such nucleotide sequences.

[0023] In a fourth embodiment, this invention relates to a chimeric genecomprising an isolated polynucleotide of the present invention operablylinked to at least one suitable regulatory sequence.

[0024] In a fifth embodiment, the present invention concerns a host cellcomprising a chimeric gene of the present invention or an isolatedpolynucleotide of the present invention. The host cell may beeukaryotic, such as a yeast or a plant cell, or prokaryotic, such as abacterial cell. The present invention also relates to a virus,preferably a baculovirus, comprising an isolated polynucleotide of thepresent invention or a chimeric gene of the present invention.

[0025] In a sixth embodiment, the invention also relates to a processfor producing a host cell comprising a chimeric gene of the presentinvention or an isolated polynucleotide of the present invention, theprocess comprising either transforming or transfecting a compatible hostcell with a chimeric gene or isolated polynucleotide of the presentinvention.

[0026] In a seventh embodiment, the invention concerns a phospholipase Dpolypeptide of at least 80 amino acids comprising at least 92% identitybased on the Clustal method of alignment compared to a polypeptideselected from the group consisting of SEQ ID NOs:120, 122, 124, 126,128, 130, 132, and 134.

[0027] In an eighth embodiment, the invention relates to a method ofselecting an isolated polynucleotide that affects the level ofexpression of a phospholipase D polypeptide or enzyme activity in a hostcell, preferably a plant cell, the method comprising the steps of: (a)constructing an isolated polynucleotide of the present invention or achimeric gene of the present invention; (b) introducing the isolatedpolynucleotide or the chimeric gene into a host cell; (c) measuring thelevel of the phospholipase D polypeptide or enzyme activity in the hostcell containing the isolated polynucleotide; and (d) comparing the levelof the phospholipase D polypeptide or enzyme activity in the host cellcontaining the isolated polynucleotide with the level of thephospholipase D polypeptide or enzyme activity in the host cell thatdoes not contain the isolated polynucleotide.

[0028] In a ninth embodiment, the invention concerns a method ofobtaining a nucleic acid fragment encoding a substantial portion of aphospholipase D polypeptide, preferably a plant phospholipase Dpolypeptide, comprising the steps of: synthesizing an oligonucleotideprimer comprising a nucleotide sequence of at least one of 60(preferably at least one of 40, most preferably at least one of 30)contiguous nucleotides derived from a nucleotide sequence selected fromthe group consisting of SEQ ID NOs:119, 121, 123, 125, 127, 129, 131,and 133 and the complement of such nucleotide sequences; and amplifyinga nucleic acid fragment (preferably a cDNA inserted in a cloning vector)using the oligonucleotide primer. The amplified nucleic acid fragmentpreferably will encode a substantial portion of a phospholipase D aminoacid sequence.

[0029] In a tenth embodiment, this invention relates to a method ofobtaining a nucleic acid fragment encoding all or a substantial portionof the amino acid sequence encoding a phospholipase D polypeptidecomprising the steps of: probing a cDNA or genomic library with anisolated polynucleotide of the present invention; identifying a DNAclone that hybridizes with an isolated polynucleotide of the presentinvention; isolating the identified DNA clone; and sequencing the cDNAor genomic fragment that comprises the isolated DNA clone.

[0030] In an eleventh embodiment, this invention concerns a composition,such as a hybridization mixture, comprising an isolated polynucleotideor polypeptide of the present invention.

[0031] In a twelfth embodiment, this invention concerns a method forpositive selection of a transformed cell comprising: (a) transforming ahost cell with the chimeric gene of the present invention or a constructof the present invention; and (b) growing the transformed host cell,preferably a plant cell, such as a monocot or a dicot, under conditionswhich allow expression of the phospholipase D polynucleotide in anamount sufficient to complement a null mutant to provide a positiveselection means.

[0032] In a thirteenth embodiment, this invention relates to a method ofaltering the level of expression of a phospholipase D in a host cellcomprising: (a) transforming a host cell with a chimeric gene of thepresent invention; and (b) growing the transformed host cell underconditions that are suitable for expression of the chimeric gene whereinexpression of the chimeric gene results in production of altered levelsof the phospholipase D in the transformed host cell.

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS

[0033] The invention can be more fully understood from the followingdetailed description and the accompanying Sequence Listing which form apart of this application.

[0034] Table 1 lists the polypeptides that are described herein, thedesignation of the cDNA clones that comprise the nucleic acid fragmentsencoding polypeptides representing all or a substantial portion of thesepolypeptides, and the corresponding identifier (SEQ ID NO:) as used inthe attached Sequence Listing. The sequence descriptions and SequenceListing attached hereto comply with the rules governing nucleotideand/or amino acid sequence disclosures in patent applications as setforth in 37 C.F.R. §1.821-1.825.

[0035] Some of the polynucleotide and polypeptide sequences identifiedin Table 1 are found in previously filed U.S. Provisional Applicationsas indicated at the bottom of the table. TABLE 1 Plant Polypeptides SEQID NO: Protein Clone Designation (Nucleotide) (Amino Acid) Corn RbokA¹p0010.cbpco75rb 1 2 Rice RbohA¹ rlr6.pk0025.h9 3 4 Wheat RbohA¹wl1n.pk0005.c8 5 6 Corn RbohA p0010.cbpco75rb:fis 7 8 Rice RbohArlr6.pk0025.h9:fis 9 10 Wheat RbohA wl1n.pk0005.c8:fis 11 12 Corn RbohB¹p0010.cbpaa44rd 13 14 Rice RbohB¹ rls2.pk0022.d7 15 16 Soybean RbohB¹src2c.pk023.f15 17 18 Wheat RbohB¹ wl1n.pk0054.d8 19 20 Rice RbohBrls2.pk0022.d7:fis 21 22 Soybean RbohB src2c.pk023.f15:fis 23 24 WheatRbohB wl1n.pk0054.d8:fis 25 26 Rice RbohC² rlr6.pk0074.e9 27 28 RiceRbohC rlr6.pk0074.e9:fis 29 30 Corn RbohD² Contig of: 31 32cco1n.pk055.115 p0127.cntar92r Rice RbohD² rr1.pk0004.a2 33 34 SoybeanRbohD² sr1.pk0073.f1 35 36 Wheat RbohD² wlm96.pk044.g9 37 38 Rice RbohDrr1.pk0004.a2:fis 39 40 Soybean RbohD sr1.pk0073.f1:fis 41 42 WheatRbohD wlm96.pk044.g9:fis 43 44 Corn Respiratory Burst p0104.cabad88rb 4546 Oxidase Protein³ Rice Respiratory Burst rsl1n.pk013.i4 47 48 OxidaseProtein³ Soybean Respiratory Burst sdp2c.pk009.b13 49 50 OxidaseProtein³ Corn Respiratory Burst p0104.cabad88rb:fis 51 52 OxidaseProtein Rice Respiratory Burst rsl1n.pk013.i4:fis 53 54 Oxidase ProteinSoybean Respiratory Burst sdp2c.pk009.b13:fis 55 56 Oxidase Protein CornRbohE³ cen3n.pk0155.f12 57 58 Soybean RbohE³ se3.02c07 59 60 WheatRbohE³ wr1.pk178.b5 61 62 Corn RbohE cen3n.pk0155.f12:fis 63 64 WheatRbohE wrl.pk178.b5:fis 65 66 Corn RbohF³ p0010.cbpaa44rb 67 68 SoybeanRbohF³ sdp4c.pk014.k19 69 70 Corn RbohF p0010.cbpaa44rb:fis 71 72Soybean RbohF sdp4c.pk014.k19:fis 73 74 Corn tRNA-mnm⁵s²U-MT⁴cco1n.pk077.o18 75 76 Soybean tRNA-mnm⁵s²U-MT⁴ se5.pk0029.d2 77 78 CorntRNA-mnm⁵s²U-MT cco1n.pk077.o18:fis 79 80 Soybean tRNA-mnm⁵s²U-MTse5.pk0029.d2:fis 81 82 Jerusalem Artichoke hel1.pk0013.b1 83 84Chromomethylase⁵ Corn Chromomethylase⁵ p0094.cssth92ra 85 86 RiceChromomethylase⁵ rl0n.pk136.o14 87 88 Wheat Chromomethylase⁵wl1n.pk0095.f3 89 90 Wheat Chromomethylase⁵ w1m0.pk0028.h3 91 92Jerusalem Artichoke hel1.pkO0013.b1:fis 93 94 Chromomethylase CornChromomethylase p0094.cssth92ra:fis 95 96 Rice Chromomethylaserl0n.pk136.o14:fis 97 98 Wheat Chromomethylase srm.pk0035.c1:fis 99 100Corn Cytosine p0100.cbaaj24r 101 102 5-Methyltransferase⁵ Rice Cytosinerr1.pk0043.f8 103 104 5-Methyltransferase⁵ Soybean Cytosinesgs2c.pk004.h13 105 106 5-Methyltransferase⁵ Wheat Cytosinewr1.pk0076.a11 107 108 5-Methyltransferase⁵ Wheat Cytosinewre1n.pk0079.c6 109 110 5-Methyltransferase⁵ Rice Cytosinerr1.pk0043.f8:fis 111 112 5-Methyltransferase Soybean Cytosinesgs2c.pk004.h13:fis 113 114 5-Methyltransferase Wheat Cytosinewr1.pk0076.a11:fis 115 116 5 -Methyltransferase Wheat Cytosinewre1n.pk0079.c6:fis 117 118 5-Methyltransferase Soybean PLD α⁶sgs4c.pk004.c18 119 120 Wheat PLD α⁶ w1k4.pk0022.b7 121 122 Soybean PLDα sfl1.pk128.a18:fis 123 124 Wheat PLD α wlk4.pk0022.b7:fis 125 126 CornPLD γ⁶ p0083.cldaz07r 127 128 Soybean PLD γ⁶ src3c.pk012.d7 129 130 CornPLD γ p0083.cldaz07r:fis 131 132 Soybean PLD γ src3c.pk012.d7:fis 133134 Corn TF IIF α Subunit⁷ p0026.ccrbd22r 135 136 Corn TF IIF α Subunitp0026.ccrbd22r:fis 137 138 Corn TF IIF β Subunit⁷ p0014.ctusq39r 139 140Wheat TF IIF β Subunit⁷ wlm24.pk0018.g9 141 142 Corn TF IIF β SubunitContig of: 143 144 p0014.ctusq39r:fis p0107.cbcap19r Rice TF IIF βSubunit rcaln.pk007.p13:fis 145 146 Rice TF IIF β Subunitrl0n.pk0063.e10:fis 147 148 Rice TF IIF β Subunit rls6.pk0059.b8:fis 149150 Wheat TF IIF β Subunit wlm24.pk0018.g9:fis 151 152 CornAsparaginyl-tRNA p0119.cmtne90r:fis 153 154 Synthetase RiceAsparaginyl-tRNA rl0n.pk0039.b7:fis 155 156 Synthetase SoybeanAsparaginyl-tRNA src1c.pk001.a5:fis 157 158 Synthetase WheatAsparaginyl-tRNA wdr1.pk0005.f7:fis 159 160 synthetase WheatAsparaginyl-tRNA wr1.pk0067.h2 161 162 synthetase Corn Glutaminyl-tRNAp0129.clmad36r:fis 163 164 synthetase Rice Glutaminyl-tRNArds1c.pk007.e9:fis 165 166 synthetase Soybean Glutaminyl-tRNAsic1c.pk001.e18:fis 167 168 synthetase Wheat Glutaminyl-tRNAwlmk1.pk0001.g6:fis 169 170 synthetase Rice EDS1 rl0n.pk127.m10:fis 171172 Soybean EDS1 sls2c.pk037.c11:fis 173 174 Wheat EDS1wre1n.pk160.d1:fis 175 176 Corn AP50 p0127.cntam18r 177 178 Rice AP50rlr6.pk0083.e10:fis 179 180 Soybean AP50 sdp3c.pk006.d23:fis 181 182Wheat AP50 wdk1c.pk012.n13:fis 183 184 Corn Alpha Adaptinp0119.cmtoj48r:fis 185 186 Soybean Alpha Adaptin sl2.pk121.m20:fis 187188 Corn Beta′ Adaptin p0119.cmtnr87r:fis 189 190 Rice Beta′ Adaptinrds1c.pk005.c17:fis 191 192 Soybean Beta′ Adaptin sls2c.pk005.m4:fis 193194 Wheat Beta′ Adaptin wkm2c.pk0002.a3 195 196

[0036] The Sequence Listing contains the one letter code for nucleotidesequence characters and the three letter codes for amino acids asdefined in conformity with the IUPAC-IUBMB standards described inNucleic Acids Res. 13:3021-3030 (1985) and in the Biochemical J. 219(No. 2):345-373 (1984) which are herein incorporated by reference. Thesymbols and format used for nucleotide and amino acid sequence datacomply with the rules set forth in 37 C.F.R. §1.822.

DETAILED DESCRIPTION OF THE INVENTION

[0037] In the context of this disclosure, a number of terms shall beutilized. The terms “polynucleotide”, “polynucleotide sequence”,“nucleic acid sequence”, and “nucleic acid fragment”/“isolated nucleicacid fragment” are used interchangeably herein. These terms encompassnucleotide sequences and the like. A polynucleotide may be a polymer ofRNA or DNA that is single- or double-stranded, that optionally containssynthetic, non-natural or altered nucleotide bases. A polynucleotide inthe form of a polymer of DNA may be comprised of one or more segments ofcDNA, genomic DNA, synthetic DNA, or mixtures thereof. An isolatedpolynucleotide of the present invention may include at least one of 60contiguous nucleotides, preferably at least one of 40 contiguousnucleotides, most preferably one of at least 30 contiguous nucleotidesderived from SEQ ID NOs:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25,27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61,63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125,127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153,155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181,183, 185, 187, 189, 191, 193, and 195, or the complement of suchsequences.

[0038] The term “isolated polynucleotide” refers to a polynucleotidethat is substantially free from other nucleic acid sequences, such asand not limited to other chromosomal and extrachromosomal DNA and RNAthat normally accompany or interact with it as found in its naturallyoccurring environment. Isolated polynucleotides may be purified from ahost cell in which they naturally occur. Conventional nucleic acidpurification methods known to skilled artisans may be used to obtainisolated polynucleotides. The term also embraces recombinantpolynucleotides and chemically synthesized polynucleotides.

[0039] The term “recombinant” means, for example, that a nucleic acidsequence is mace by an artificial combination of two otherwise separatedsegments of sequence, e.g., by chemical synthesis or by the manipulationof isolated nucleic acids by genetic engineering techniques.

[0040] As used herein, “contig” refers to a nucleotide sequence that isassembled from two or more constituent nucleotide sequences that sharecommon or overlapping regions of sequence homology. For example, thenucleotide sequences of two or more nucleic acid fragments can becompared and aligned in order to identify common or overlappingsequences. Where common or overlapping sequences exist between two ormore nucleic acid fragments, the sequences (and thus their correspondingnucleic acid fragments) can be assembled into a single contiguousnucleotide sequence.

[0041] As used herein, “substantially similar” refers to nucleic acidfragments wherein changes in one or more nucleotide bases results insubstitution of one or more amino acids, but do not affect thefunctional properties of the polypeptide encoded by the nucleotidesequence. “Substantially similar” also refers to nucleic acid fragmentswherein changes in one or more nucleotide bases does not affect theability of the nucleic acid fragment to mediate alteration of geneexpression by gene silencing through for example antisense orco-suppression technology. “Substantially similar” also refers tomodifications of the nucleic acid fragments of the instant inventionsuch as deletion or insertion of one or more nucleotides that do notsubstantially affect the functional properties of the resultingtranscript vis-à-vis the ability to mediate gene silencing or alterationof the functional properties of the resulting protein molecule. It istherefore understood that the invention encompasses more than thespecific exemplary nucleotide or amino acid sequences and includesfunctional equivalents thereof. The terms “substantially similar” and“corresponding substantially” are used interchangeably herein.

[0042] Substantially similar nucleic acid fragments may be selected byscreening nucleic acid fragments representing subfragments ormodifications of the nucleic acid fragments of the instant invention,wherein one or more nucleotides are substituted, deleted and/orinserted, for their ability to affect the level of the polypeptideencoded by the unmodified nucleic acid fragment in a plant or plantcell. For example, a substantially similar nucleic acid fragmentrepresenting at least one of 30 contiguous nucleotides derived from theinstant nucleic acid fragment can be constructed and introduced into aplant or plant cell. The level of the polypeptide encoded by theunmodified nucleic acid fragment present in a plant or plant cellexposed to the substantially similar nucleic fragment can then becompared to the level of the polypeptide in a plant or plant cell thatis not exposed to the substantially similar nucleic acid fragment.

[0043] For example, it is well known in the art that antisensesuppression and co-suppression of gene expression may be accomplishedusing nucleic acid fragments representing less than the entire codingregion of a gene, and by using nucleic acid fragments that do not share100% sequence identity with the gene to be suppressed. Moreover,alterations in a nucleic acid fragment which result in the production ofa chemically equivalent amino acid at a given site, but do not effectthe functional properties of the encoded polypeptide, are well known inthe art. Thus, a codon for the amino acid alanine, a hydrophobic aminoacid, may be substituted by a codon encoding another less hydrophobicresidue, such as glycine, or a more hydrophobic residue, such as valine,leucine, or isoleucine. Similarly, changes which result in substitutionof one negatively charged residue for another, such as aspartic acid forglutamic acid, or one positively charged residue for another, such aslysine for arginine, can also be expected to produce a functionallyequivalent product. Nucleotide changes which result in alteration of theN-terminal and C-terminal portions of the polypeptide molecule wouldalso not be expected to alter the activity of the polypeptide. Each ofthe proposed modifications is well within the routine skill in the art,as is determination of retention of biological activity of the encodedproducts. Consequently, an isolated polynucleotide comprising anucleotide sequence of at least one of 60 (preferably at least one of40, most preferably at least one of 30) contiguous nucleotides derivedfrom a nucleotide sequence selected from the group consisting of SEQ IDNOs:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35.37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71,73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105,107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133,135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161,163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189,191, 193, and 195 and the complement of such nucleotide sequences may beused in methods of selecting an isolated polynucleotide that affects theexpression of a respiratory burst oxidase homologs, methyltransferases,methylases, phospholipases, transcription factors, aminoacyl-tRNAsynthetases, AP-2 subunits, or EDS1 polypeptide in a host cell. A methodof selecting an isolated polynucleotide that affects the level ofexpression of a polypeptide in a virus or in a host cell (eukaryotic,such as plant or yeast, prokaryotic such as bacterial) may comprise thesteps of: constructing an isolated polynucleotide of the presentinvention or a chimeric gene of the present invention; introducing theisolated polynucleotide or the chimeric gene into a host cell; measuringthe level of a polypeptide or enzyme activity in the host cellcontaining the isolated polynucleotide; and comparing the level of apolypeptide or enzyme activity in the host cell containing the isolatedpolynucleotide with the level of a polypeptide or enzyme activity in ahost cell that does not contain the isolated polynucleotide.

[0044] Moreover, substantially similar nucleic acid fragments may alsobe characterized by their ability to hybridize. Estimates of suchhomology are provided by either DNA-DNA or DNA-RNA hybridization underconditions of stringency as is well understood by those skilled in theart (Hames and Higgins, Eds. (1985) Nucleic Acid Hybridisation, IRLPress, Oxford, U.K.). Stringency conditions man be adjusted to screenfor moderately similar fragments, such as homologous sequences fromdistantly related organisms, to highly similar fragments, such as genesthat duplicate functional enzymes from closely related organisms.Post-hybridization washes determine stringency conditions. One set ofpreferred conditions uses a series of washes starting with 6×SSC, 0.5%SDS at room temperature for 15 min, then repeated with 2×SSC, 0.5% SDSat 45° C. for 30 min, and then repeated twice with 0.2×SSC, 0.5% SDS at50° C. for 30 min. A more preferred set of stringent conditions useshigher temperatures in which the washes are identical to those aboveexcept for the temperature of the final two 30 min washes in 0.2×SSC,0.5% SDS which was increased to 60° C. Another preferred set of highlystringent conditions uses two final washes in 0.1×SSC, 0.1% SDS at 65°C.

[0045] Substantially similar nucleic acid fragments of the instantinvention may also be characterized by the percent identity of the aminoacid sequences that they encode to the amino acid sequences disclosedherein, as determined by algorithms commonly employed by those skilledin this art. Suitable nucleic acid fragments (isolated polynucleotidesof the present invention) encode polypeptides that are at least about70% identical, preferably at least about 80% identical to the amino acidsequences reported herein. Preferred nucleic acid fragments encode aminoacid sequences that are about 85% identical to the amino acid sequencesreported herein. More preferred nucleic acid fragments encode amino acidsequences that are at least about 90% identical to the amino acidsequences reported herein. Most preferred are nucleic acid fragmentsthat encode amino acid sequences that are at least about 95% identicalto the amino acid sequences reported herein. Suitable nucleic acidfragments not only have the above identities but typically encode apolypeptide having at least 50 amino acids, preferably at least 100amino acids, more preferably at least 150 amino acids, still morepreferably at least 200 amino acids, and most preferably at least 250amino acids. Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5.

[0046] Methods of alignment of sequences for comparison are well-knownin the art. Optimal alignment of sequences for comparison may beconducted by the local homology algorithm of Smith and Waterman, Adv.Appl. Math. 2: 482 (1981); by the homology alignment algorithm ofNeedleman and Wunsch, J. Mol. Biol 48: 443 (1970); by the search forsimilarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. 85: 2444(1988); by computerized implementations of these algorithms, including,but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics,Mountain View, Calif.; GAP, BESTFIT, BLAST, FASTA, and TFASTA in theWisconsin Genetics Software Package, Genetics Computer Group (GCG), 575Science Dr., Madison, Wis., USA; the CLUSTAL program is well describedby Higgins and Sharp, Gene 73: 237-244 (1988); Higgins and Sharp, CABIOS5: 151-153 (1989); Corpet, et al., Nucleic Acids Research 16: 10881-90(1988); Huang, et al., Computer Applications in the Biosciences 8:155-65 (1992), and Pearson, et al., Methods in Molecular Biology 24:307-331 (1994).

[0047] The BLAST family of programs which can be used for databasesimilarity searches it includes: BLASTN for nucleotide query sequencesagainst nucleotide database sequences; BLASTX for nucleotide querysequences against protein database sequences; BLASTP for protein querysequences against protein database sequences; TBLASTN for protein querysequences against nucleotide database sequences; and TBLASTX fornucleotide query sequences against nucleotide database sequences. See,Current Protocols in Molecular Biology, Chapter 19, Ausubel, et al.,Eds., Greene Publishing and Wiley-Interscience, New York (1995);Altschul et al., J. Mol. Biol., 215:403-410 (1990); and, Altschul etal., Nucleic Acids Res. 25:3389-3402 (1997).

[0048] GAP (Global Alignment Program) can also be used to compare apolynucleotide or polypeptide of the present invention with a referencesequence. GAP uses the algorithm of Needleman and Wunsch (J. Mol. Biol.48:443-453, 1970) to find the alignment of two complete sequences thatmaximizes the number of matches and minimizes the number of gaps. GAPconsiders all possible alignments and gap positions and creates thealignment with the largest number of matched bases and the fewest gaps.The Wisconsin Genetics Software Package for protein sequences uses a gapcreation penalty value of 8 and a gap extension penalty value of 2. Forpolynucleotide sequences, the default gap creation penalty is 50 whilethe default gap extension penalty is 3. These penalties can be expressedas an integer selected from 0 to 100. Thus, for example, the gapcreation and gap extension penalties can each independently be:0, 1, 2,3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60 or greater. The scoringmatrix used in Version 10 of the Wisconsin Genetics Software Package isBLOSUM62 (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA89:10915).

[0049] A “substantial portion” of an amino acid or nucleotide sequencecomprises an amino acid or a nucleotide sequence that is sufficient toafford putative identification of the protein or gene that the aminoacid or nucleotide sequence comprises. Amino acid and nucleotidesequences can be evaluated either manually by one skilled in the art, orby using computer-based sequence comparison and identification toolsthat employ algorithms such as BLAST (Basic Local Alignment Search Tool;Altschul et al. (1993) J. Mol. Biol. 215:403-410; see alsowww.ncbi.nlm.nih.gov/BLAST/). In general, a sequence of ten or morecontiguous amino acids or thirty or more contiguous nucleotides isnecessary in order to putatively identify a polypeptide or nucleic acidsequence as homologous to a known protein or gene. Moreover, withrespect to nucleotide sequences, gene-specific oligonucleotide probescomprising 30 or more contiguous nucleotides may be used insequence-dependent methods of gene identification (e.g., Southernhybridization) and isolation (e.g., in situ hybridization of bacterialcolonies or bacteriophage plaques). In addition, short oligonucleotidesof 12 or more nucleotides may be used as amplification primers in PCR inorder to obtain a particular nucleic acid fragment comprising theprimers. Accordingly, a “substantial portion” of a nucleotide sequencecomprises a nucleotide sequence that will afford specific identificationand/or isolation of a nucleic acid fragment comprising the sequence. Theinstant specification teaches amino acid and nucleotide sequencesencoding polypeptides that comprise one or more particular plantproteins. The skilled artisan, having the benefit of the sequences asreported herein, may now use all or a substantial portion of thedisclosed sequences for purposes known to those skilled in this art.Accordingly, the instant invention comprises the complete sequences asreported in the accompanying Sequence Listing, as well as substantialportions of those sequences as defined above.

[0050] “Codon degeneracy” refers to divergence in the genetic codepermitting variation of the nucleotide sequence without effecting theamino acid sequence of an encoded polypeptide. Accordingly, the instantinvention relates to any nucleic acid fragment comprising a nucleotidesequence that encodes all or a substantial portion of the amino acidsequences set forth herein. The skilled artisan is well aware of the“codon-bias” exhibited by a specific host cell in usage of nucleotidecodons to specify a given amino acid. Therefore, when synthesizing anucleic acid fragment for improved expression in a host cell, it isdesirable to design the nucleic acid fragment such that its frequency ofcodon usage approaches the frequency of preferred codon usage of thehost cell.

[0051] “Synthetic nucleic acid fragments” can be assembled fromoligonucleotide building blocks that are chemically synthesized usingprocedures known to those skilled in the art. These building blocks areligated and annealed to form larger nucleic acid fragments which maythen be enzymatically assembled to construct the entire desired nucleicacid fragment. “Chemically synthesized”, as related to a nucleic acidfragment, means that the component nucleotides were assembled in vitro.Manual chemical synthesis of nucleic acid fragments may be accomplishedusing well established procedures, or automated chemical synthesis canbe performed using one of a number of commercially available machines.Accordingly, the nucleic acid fragments can be tailored for optimal geneexpression based on optimization of the nucleotide sequence to reflectthe codon bias of the host cell. The skilled artisan appreciates thelikelihood of successful gene expression if codon usage is biasedtowards those codons favored by the host. Determination of preferredcodons can be based on a survey of genes derived from the host cellwhere sequence information is available.

[0052] “Gene” refers to a nucleic acid fragment that expresses aspecific protein, including regulatory sequences preceding (5′non-coding sequences) and following (3′ non-coding sequences) the codingsequence. “Native gene” refers to a gene as found in nature with its ownregulatory sequences. “Chimeric gene” refers to any gene that is not anative gene, comprising regulatory and coding sequences that are notfound together in nature. Accordingly, a chimeric gene may compriseregulatory sequences and coding sequences that are derived fromdifferent sources, or regulatory sequences and coding sequences derivedfrom the same source, but arranged in a manner different than that foundin nature. “Endogenous gene” refers to a native gene in its naturallocation in the genome of an organism. A “foreign gene” refers to a genenot normally found in the host organism, but that is introduced into thehost organism by gene transfer. Foreign genes can comprise native genesinserted into a non-native organism, or chimeric genes. A “transgene” isa gene that has been introduced into the genome by a transformationprocedure.

[0053] “Coding sequence” refers to a nucleotide sequence that codes fora specific amino acid sequence. “Regulatory sequences” refers tonucleotide sequences located upstream (5′ non-coding sequences), within,or downstream (3′ non-coding sequences) of a coding sequence, and whichinfluence the transcription, RNA processing or stability, or translationof the associated coding sequence. Regulatory sequences may includepromoters, translation leader sequences, introns, and polyadenylationrecognition sequences.

[0054] “Promoter” refers to a nucleotide sequence capable of controllingthe expression of a coding sequence or functional RNA. In general, acoding sequence is located 3′ to a promoter sequence. The promotersequence consists of proximal and more distal upstream elements, thelatter elements often referred to as enhancers. Accordingly, an“enhancer” is a nucleotide sequence which can stimulate promoteractivity and may be an innate element of the promoter or a heterologouselement inserted to enhance the level or tissue-specificity of apromoter. Promoters may be derived in their entirety from a native gene,or may be composed of different elements derived from differentpromoters found in nature, or may even comprise synthetic nucleotidesegments. It is understood by those skilled in the art that differentpromoters may direct the expression of a gene in different tissues orcell types, or at different stages of development, or in response todifferent environmental conditions. Promoters which cause a nucleic acidfragment to be expressed in most cell types at most times are commonlyreferred to as “constitutive promoters”. New promoters of various typesuseful in plant cells are constantly being discovered; numerous examplesmay be found in the compilation by Okamuro and Goldberg (1989)Biochemistry of Plants 15:1-82. It is further recognized that since inmost cases the exact boundaries of regulatory sequences have not beencompletely defined, nucleic acid fragments of different lengths may haveidentical promoter activity.

[0055] “Translation leader sequence” refers to a nucleotide sequencelocated between the promoter sequence of a gene and the coding sequence.The translation leader sequence is present in the fully processed mRNAupstream of the translation start sequence. The translation leadersequence may affect processing of the primary transcript to mRNA, mRNAstability or translation efficiency. Examples of translation leadersequences have been described (Turner and Foster (1995) Mol. Biotechnol.3:225-236).

[0056] “3′ Non-coding sequences” refers to nucleotide sequences locateddownstream of a coding sequence and includes polyadenylation recognitionsequences and other sequences encoding regulatory signals capable ofaffecting mRNA processing or gene expression. The polyadenylation signalis usually characterized by affecting the addition of polyadenylic acidtracts to the 3′ end of the mRNA precursor. The use of different 3′non-coding sequences is exemplified by Ingelbrecht et al. (1989) PlantCell 1:671-680.

[0057] “RNA transcript” refers to the product resulting from RNTApolymerase-catalyzed transcription of a DNA sequence. When the RNAtranscript is a perfect complementary copy of the DNA sequence, it isreferred to as the primary transcript or it may be a RNA sequencederived from posttranscriptional processing of the primary transcriptand is referred to as the mature RNA. “Messenger RNA (mRNA)” refers tothe RNA that is without introns and can be translated into polypeptidesby the cell. “cDNA” refers to DNA that is complementary to and derivedfrom an mRNA template. The cDNA can be single-stranded or converted todouble stranded form using, for example, the Klenow fragment of DNApolymerase I. “Sense RNA” refers to an RNA transcript that includes themRNA and can be translated into a polypeptide by the cell. “AntisenseRNA” refers to an RNA transcript that is complementary to all or part ofa target primary transcript or mRNA and that blocks the expression of atarget gene (see U.S. Pat. No. 5,107,065, incorporated herein byreference). The complementarity of an antisense RNA may be with any partof the specific nucleotide sequence, i.e., at the 5′ non-codingsequence, 3′ non-coding sequence, introns, or the coding sequence.“Functional RNA” refers to sense RNA, antisense RNA, ribozyme RNA, orother RNA that may not be translated but yet has an effect on cellularprocesses.

[0058] The term “operably linked” refers to the association of two ormore nucleic acid fragments so that the function of one is affected bythe other. For example, a promoter is operably linked with a codingsequence when it is capable of affecting the expression of that codingsequence (i.e., that the coding sequence is under the transcriptionalcontrol of the promoter). Coding sequences can be operably linked toregulatory sequences in sense or antisense orientation.

[0059] The term “expression”, as used herein, refers to thetranscription and stable accumulation of sense (mRNA) or antisense RNAderived from the nucleic acid fragment of the invention. “Expression”may also refer to translation of mRNA into a polypeptide. “Anti senseinhibition” refers to the production of antisense RNA transcriptscapable of suppressing the expression of the target protein.“Overexpression” refers to the production of a gene product intransgenic organisms that exceeds levels of production in normal ornon-transformed organisms. “Co-suppression” refers to the production ofsense RNA transcripts capable of suppressing the expression of identicalor substantially similar foreign or endogenous genes (U.S. Pat. No.5,231,020, incorporated herein by reference).

[0060] A “protein” or “polypeptide” is a chain of amino acids arrangedin a specific order determined by the coding sequence in apolynucleotide encoding the polypeptide. Each protein or polypeptide hasa unique function.

[0061] “Altered levels” or “altered expression” refer to the productionof gene product(s) in transgenic organisms in amounts or proportionsthat differ from that of normal or non-transformed organisms.

[0062] “Mature protein” or the term “mature” when used in describing aprotein refers to a post-translationally processed polypeptide; i.e.,one from which any pre- or propeptides present in the primarytranslation product have been removed. “Precursor protein” or the term“precursor” when used in describing a protein refers to the primaryproduct of translation of mRNA; i.e., with pre- and propeptides stillpresent. Pre- and propeptides may be but are not limited tointracellular localization signals.

[0063] A “chloroplast transit peptide” is an amino acid sequence whichis translated in conjunction with a protein and directs the protein tothe chloroplast or other plastid types present in the cell in which theprotein is made. “Chloroplast transit sequence” refers to a nucleotidesequence that encodes a chloroplast transit peptide. A “signal peptide”is an amino acid sequence which is translated in conjunction with aprotein and directs the protein to the secretory system (Chrispeels(1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42:21-53). If the proteinis to be directed to a vacuole, a vacuolar targeting signal (supra) canfurther be added, or if to the endoplasmic reticulum, an endoplasmicreticulum retention signal (supra) may be added. If the protein is to bedirected to the nucleus, any signal peptide present should be removedand instead a nuclear localization signal included (Raikhel (1992) PlantPhys. 100:1627-1632).

[0064] “Transformation” refers to the transfer of a nucleic acidfragment into the genome of a host organism, resulting in geneticallystable inheritance. Host organisms containing the transformed nucleicacid fragments are referred to as “transgenic” organisms. Examples ofmethods of plant transformation include 4Agrobacterium-mediatedtransformation (De Blaere et al. (1987) Meth. Enzymol. 143:277) andparticle-accelerated or “gene gun” transformation technology (Klein etal. (1987) Nature (London) 327:70-73; U.S. Pat. No.4,945,050,incorporated herein by reference). Thus, isolated polynucleotides of thepresent invention can be incorporated into recombinant constructs,typically DNA constructs, capable of introduction into and replicationin a host cell. Such a construct can be a vector that includes areplication system and sequences that are capable of transcription andtranslation of a polypeptide-encoding sequence in a given host cell. Anumber of vectors suitable for stable transfection of plant cells or forthe establishment of transgenic plants have been described in, e.g.,Pouwels et al., Cloning Vectors: A Laboratory Manual, 1985, supp. 1987;Weissbach and Weissbach, Methods for Plant Molecular Biology, AcademicPress, 1989; and Flevin et al., Plant Molecular Biology Manual, KluwerAcademic Publishers, 1990. Typically, plant expression vectors include,for example, one or more cloned plant genes under the transcriptionalcontrol of 5′ and 3′ regulatory sequences and a dominant selectablemarker. Such plant expression vectors also can contain a promoterregulatory region (e.g., a regulatory region controlling inducible orconstitutive, environmentally- or developmentally-regulated, or cell- ortissue-specific expression), a transcription initiation start site, aribosome binding site, an RNA processing signal, a transcriptiontermination site, and/or a polyadenylation signal.

[0065] Standard recombinant DNA and molecular cloning techniques usedherein are well known in the art and are described more fully inSambrook et al. Molecular Cloning: A Laboratory Manual; Cold SpringHarbor Laboratory Press: Cold Spring Harbor, 1989 (hereinafter“Maniatis”).

[0066] “PCR” or “polymerase chain reaction” is well known by thoseskilled in the art as a technique used for the amplification of specificDNA segments (U.S. Pat. Nos. 4,683,195 and 4,800,159).

[0067] The present invention concerns an isolated polynucleotidecomprising a nucleotide sequence selected from the group consisting of:(a) a first nucleotide sequence encoding a polypeptide of at least 80amino acids having at least 92% identity based on the Clustal method ofalignment when compared to a polypeptide selected from the groupconsisting of SEQ ID NOs:120, 122, 124, 126, 128, 130, 132, and 134, and(b) a second nucleotide sequence comprising the complement of the firstnucleotide sequence.

[0068] Preferably, the first nucleotide sequence comprises a nucleicacid sequence selected from the group consisting of SEQ ID NOs:119, 121,123, 125, 127, 129, 131, and 133, that codes for the polypeptideselected from the group consisting of SEQ ID NOs:120, 122, 124, 126,128, 130, 132, and 134.

[0069] Nucleic acid fragments encoding at least a substantial portion ofseveral plant polypeptides have been isolated and identified bycomparison of random plant cDNA sequences to public databases containingnucleotide and protein sequences using the BLAST algorithms well knownto those skilled in the art. The nucleic acid fragments of the instantinvention may be used to isolate cDNAs and genes encoding homologousproteins from the same or other plant species. Isolation of homologousgenes using sequence-dependent protocols is well known in the art.Examples of sequence-dependent protocols include, but are not limitedto, methods of nucleic acid hybridization, and methods of DNA and RNAamplification as exemplified by various uses of nucleic acidamplification technologies (e.g., polymerase chain reaction, ligasechain reaction).

[0070] For example, genes encoding other respiratory burst oxidasehomologs, methyltransferases, methylases, phospholipases, transcriptionfactors, aminoacyl-tRNA synthetases, AP-2 subunits, or EDS1, either ascDNAs or genomic DNAs, could be isolated directly by using all or asubstantial portion of the instant nucleic acid fragments as DNAhybridization probes to screen libraries from any desired plantemploying methodology well known to those skilled in the art. Specificoligonucleotide probes based upon the instant nucleic acid sequences canbe designed and synthesized by methods known in the art (Maniatis).Moreover, entire sequence(s) can be used directly to synthesize DNAprobes by methods known to the skilled artisan such as random primer DNAlabeling, nick translation, end-labeling techniques, or RNA probes usingavailable in vitro transcription systems. In addition, specific primerscan be designed and used to amplify a part or all of the instantsequences. The resulting amplification products can be labeled directlyduring amplification reactions or labeled after amplification reactions,and used as probes to isolate full length cDNA or genomic fragmentsunder conditions of appropriate stringency.

[0071] In addition, two short segments of the instant nucleic acidfragments may be used in polymerase chain reaction protocols to amplifylonger nucleic acid fragments encoding homologous genes from DNA or RNA.The polymerase chain reaction may also be performed on a library ofcloned nucleic acid fragments wherein the sequence of one primer isderived from the instant nucleic acid fragments, and the sequence of theother primer takes advantage of the presence of the polyadenylic acidtracts to the 3′ end of the mRNA precursor encoding plant genes.Alternatively, the second primer sequence may be based upon sequencesderived from the cloning vector. For example, the skilled artisan canfollow the RACE protocol (Frohman et al. (1988) Proc. Natl. Acad. Sci.USA 85:8998-9002) to generate cDNAs by using PCR to amplify copies ofthe region between a single point in the transcript and the 3′ or 5′end. Primers oriented in the 3′ and 5′ directions can be designed fromthe instant sequences. Using commercially available 3′ RACE or 5′ RACEsystems (BRL), specific 3′ or 5′ cDNA fragments can be isolated (Oharaet al. (1989) Proc. Natl. Acad Sci. USA 86:5673-5677; Loh et al. (1989)Science 243:217-220). Products generated by the 3′ and 5′ RACEprocedures can be combined to generate full-length cDNAs (Frohman andMartin (1989) Techniques 1:165). Consequently, a polynucleotidecomprising a nucleotide sequence of at least one of 60 (preferably oneof at least 40, most preferably one of at least 30) contiguousnucleotides derived from a nucleotide sequence selected from the groupconsisting of SEQ ID NOs:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25,27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61,63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125,127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153,155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181,183, 185, 187, 189, 191, 193, and 195 and the complement of suchnucleotide sequences may be used in such methods to obtain a nucleicacid fragment encoding a substantial portion of an amino acid sequenceof a polypeptide.

[0072] The present invention relates to a method of obtaining a nucleicacid fragment encoding a substantial portion of a respiratory burstoxidase homolog, methyltransferase, methylase, phospholipase,transcription factor, aminoacyl-tRNA synthetase, AP-2 subunit, or EDS1polypeptide, preferably a substantial portion of a plant respiratoryburst oxidase homolog, methyltransferase, methylase, phospholipase,transcription factor, aminoacyl-tRNA synthetase, AP-2 subunit, or EDS1polypeptide, comprising the steps of: synthesizing an oligonucleotideprimer comprising a nucleotide sequence of at least one of 60(preferably at least one of 40, most preferably at least one of 30)contiguous nucleotides derived from a nucleotide sequence selected fromthe group consisting of SEQ ID NOs:1, 3, 5, 7, 9, 11, 13, 15, 17, 19,21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55,57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91,93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121,123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149,151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177,179, 181, 183, 185, 187, 189, 191, 193, and 195, and the complement ofsuch nucleotide sequences; and amplifying a nucleic acid fragment(preferably a cDNA inserted in a cloning vector) using theoligonucleotide primer. The amplified nucleic acid fragment preferablywill encode a substantial portion of a respiratory burst oxidasehomolog, methyltransferase, methylase, phospholipase, transcriptionfactor, aminoacyl-tRNA synthetase, AP-2 subunit, or EDS1 polypeptide.

[0073] Availability of the instant nucleotide and deduced amino acidsequences facilitates immunological screening of cDNA expressionlibraries. Synthetic peptides representing substantial portions of theinstant amino acid sequences may be synthesized. These peptides can beused to immunize animals to produce polyclonal or monoclonal antibodieswith specificity for peptides or proteins comprising the amino acidsequences. These antibodies can be then be used to screen cDNAexpression libraries to isolate full-length cDNA clones of interest(Lerner (1984) Adv. Immunol. 36:1-34; Maniatis).

[0074] In another embodiment, this invention concerns viruses and hostcells comprising either the chimeric genes of the invention as describedherein or an isolated polynucleotide of the invention as describedherein. Examples of host cells which can be used to practice theinvention include, but are not limited to, yeast, bacteria, and plants.

[0075] As was noted above, the nucleic acid fragments of the instantinvention may be used to create transgenic plants in which the disclosedpolypeptides are present at higher or lower levels than normal or incell types or developmental stages in which they are not normally found.This would have the effect of altering the level of stress and diseaseresistance, enhancement of gene expression or transcription, qualitygrain improvement, or generation of novel starches in those cells.

[0076] Overexpression of the proteins of the instant invention may beaccomplished by first constructing a chimeric gene in which the codingregion is operably linked to a promoter capable of directing expressionof a gene in the desired tissues at the desired stage of development.The chimeric gene may comprise promoter sequences and translation leadersequences derived from the same genes. 3′ Non-coding sequences encodingtranscription termination signals may also be provided. The instantchimeric gene may also comprise one or more introns in order tofacilitate gene expression.

[0077] Plasmid vectors comprising the instant isolated polynucleotide(or chimeric gene) may be constructed. The choice of plasmid vector isdependent upon the method that will be used to transform host plants.The skilled artisan is well aware of the genetic elements that must bepresent on the plasmid vector in order to successfully transform, selectand propagate host cells containing the chimeric gene. The skilledartisan will also recognize that different independent transformationevents will result in different levels and patterns of expression (Joneset al. (1985) EMBO J. 4:2411-2418; De Almeida et al. (1989) Mol. Gen.Genetics 218:78-86), and thus that multiple events must be screened inorder to obtain lines displaying the desired expression level andpattern. Such screening may be accomplished by Southern analysis of DNA,Northern analysis of mRNA expression, Western analysis of proteinexpression, or phenotypic analysis.

[0078] For some applications it may be useful to direct the instantpolypeptides to different cellular compartments, or to facilitate theirsecretion from the cell. It is thus envisioned that the chimeric genedescribed above may be further supplemented by directing the codingsequence to encode the instant polypeptides with appropriateintracellular targeting sequences such as transit sequences (Keegstra(1989) Cell 56:247-253), signal sequences or sequences encodingendoplasmic reticulum localization (Chrispeels (1991) Ann. Rev. PlantPhys. Plant Mol Biol. 42:21-53), or nuclear localization signals(Raikhel (1992) Plant Phys. 100:1627-1632) with or without removingtargeting sequences that are already present. While the references citedgive examples of each of these, the list is not exhaustive and moretargeting signals of use may be discovered in the future.

[0079] It may also be desirable to reduce or eliminate expression ofgenes encoding the instant polypeptides in plants for some applications.In order to accomplish this, a chimeric gene designed for co-suppressionof the instant polypeptide can be constructed by linking a gene or genefragment encoding that polypeptide to plant promoter sequences.Alternatively, a chimeric gene designed to express antisense RNA for allor part of the instant nucleic acid fragment can be constructed bylinking the gene or gene fragment in reverse orientation to plantpromoter sequences. Either the co-suppression or antisense chimericgenes could be introduced into plants via transformation whereinexpression of the corresponding endogenous genes are reduced oreliminated.

[0080] Molecular genetic solutions to the generation of plants withaltered gene expression have a decided advantage over more traditionalplant breeding approaches. Changes in plant phenotypes can be producedby specifically inhibiting expression of one or more genes by antisenseinhibition or co-suppression (U.S. Pat. Nos. 5,190,931, 5,107,065 and5,283,323). An antisense or co-suppression construct would act as adominant negative regulator of gene activity. While conventionalmutations can yield negative regulation of gene activity these effectsare most likely recessive. The dominant negative regulation availablewith a transgenic approach may be advantageous from a breedingperspective. In addition, the ability to restrict the expression of aspecific phenotype to the reproductive tissues of the plant by the useof tissue specific promoters may confer agronomic advantages relative toconventional mutations which may have an effect in all tissues in whicha mutant gene is ordinarily expressed.

[0081] The person skilled in the art will know that specialconsiderations are associated with the use of antisense or cosuppressiontechnologies in order to reduce expression of particular genes. Forexample, the proper level of expression of sense or antisense genes mayrequire the use of different chimeric genes utilizing differentregulatory elements known to the skilled artisan. Once transgenic plantsare obtained by one of the methods described above, it will be necessaryto screen individual transgenics for those that most effectively displaythe desired phenotype. Accordingly, the skilled artisan will developmethods for screening large numbers of transformants. The nature ofthese screens will generally be chosen on practical grounds. Forexample, one can screen by looking for changes in gene expression byusing antibodies specific for the protein encoded by the gene beingsuppressed, or one could establish assays that specifically measureenzyme activity. A preferred method will be one which allows largenumbers of samples to be processed rapidly, since it will be expectedthat a large number of transformants will be negative for the desiredphenotype.

[0082] In another embodiment, the present invention concerns apolypeptide of at least 80 amino acids having at least 92% identitybased on the Clustal method of alignment when compared to a polypeptideselected from the group consisting of SEQ ID NOs:120, 122, 124, 126,128, 130, 132 and 134.

[0083] The instant polypeptides (or substantial portions thereof) may beproduced in heterologous host cells, particularly in the cells ofmicrobial hosts, and can be used to prepare antibodies to these proteinsby methods well known to those skilled in the art. The antibodies areuseful for detecting the polypeptides of the instant invention in situin cells or in vitro in cell extracts. Preferred heterologous host cellsfor production of the instant polypeptides are microbial hosts.Microbial expression systems and expression vectors containingregulatory sequences that direct high level expression of foreignproteins are well known to those skilled in the art. Any of these couldbe used to construct a chimeric gene for production of the instantpolypeptides. This chimeric gene could then be introduced intoappropriate microorganisms via transformation to provide high levelexpression of the encoded polypeptide. An example of a vector for highlevel expression of the instant polypeptides in a bacterial host isprovided (Example 25).

[0084] Additionally, some of the instant polypeptides can be used as atarget to facilitate design and/or identification of inhibitors of thoseenzymes that may be useful as herbicides. This is desirable because thepolypeptides described herein catalyze various steps in RNA processing.Accordingly, inhibition of the activity of one or more of the enzymesdescribed herein could lead to inhibition of plant growth. Thus, theinstant polypeptides could be appropriate for new herbicide discoveryand design.

[0085] All or a substantial portion of the polynucleotides of theinstant invention may also be used as probes for genetically andphysically mapping the genes that they are a part of, and used asmarkers for traits linked to those genes. Such information may be usefulin plant breeding in order to develop lines with desired phenotypes. Forexample, the instant nucleic acid fragments may be used as restrictionfragment length polymorphism (RFLP) markers. Southern blots (Maniatis)of restriction-digested plant genomic DNA may be probed with the nucleicacid fragments of the instant invention. The resulting banding patternsmay then be subjected to genetic analyses using computer programs suchas MapMaker (Lander et al. (1987) Genomics 1:174-181) in order toconstruct a genetic map. In addition, the nucleic acid fragments of theinstant invention may be used to probe Southern blots containingrestriction endonuclease-treated genomic DNAs of a set of individualsrepresenting parent and progeny of a defined genetic cross. Segregationof the DNA polymorphisms is noted and used to calculate the position ofthe instant nucleic acid sequence in the genetic map previously obtainedusing this population (Botstein et al. (1980) Am. J. Hum. Genet.32:314-331).

[0086] The production and use of plant gene-derived probes for use ingenetic mapping is described in Bernatzky and Tanksley (1986) Plant Mol.Biol. Reporter 4:37-41. Numerous publications describe genetic mappingof specific cDNA clones using the methodology outlined above orvariations thereof. For example, F2 intercross populations, backcrosspopulations, randomly mated populations, near isogenic lines, and othersets of individuals may be used for mapping. Such methodologies are wellknown to those skilled in the art.

[0087] Nucleic acid probes derived from the instant nucleic acidsequences may also be used for physical mapping (i.e., placement ofsequences on physical maps; see Hoheisel et al. In: Nonmammalian GenomicAnalysis: A Practical Guide, Academic press 1996, pp. 319-346, andreferences cited therein).

[0088] In another embodiment, nucleic acid probes derived from theinstant nucleic acid sequences may be used in direct fluorescence insitu hybridization (FISH) mapping (Trask (1991) Trends Genet.7:149-154). Although current methods of FISH mapping favor use of largeclones (several to several hundred KB; see Laan et al. (1995) GenomeRes. 5:13-20), improvements in sensitivity may allow performance of FISHmapping using shorter probes.

[0089] A variety of nucleic acid amplification-based methods of geneticand physical mapping may be carried out using the instant nucleic acidsequences. Examples include allele-specific amplification (Kazazian(1989) J. Lab. Clin. Med. 11:95-96), polymorphism of PCR-amplifiedfragments (CAPS; Sheffield et al. (1993) Genomics 16:325-332),allele-specific ligation (Landegren et al. (1988) Science241:1077-1080), nucleotide extension reactions (Sokolov (1990) NucleicAcid Res. 18:3671), Radiation Hybrid Mapping (Walter et al. (1997) Nat.Genet. 7:22-28) and Happy Mapping (Dear and Cook (1989) Nucleic AcidRes. 17:6795-6807). For these methods, the sequence of a nucleic acidfragment is used to design and produce primer pairs for use in theamplification reaction or in primer extension reactions. The design ofsuch primers is well known to those skilled in the art. In methodsemploying PCR-based genetic mapping, it may be necessary to identify DNAsequence differences between the parents of the mapping cross in theregion corresponding to the instant nucleic acid sequence. This,however, is generally not necessary for mapping methods.

[0090] Loss of function mutant phenotypes may be identified for theinstant cDNA clones either by targeted gene disruption protocols or byidentifying specific mutants for these genes contained in a maizepopulation carrying mutations in all possible genes (Ballinger andBenzer (1989) Proc. Natl. Acad. Sci USA 86:9402-9406; Koes et al. (1995)Proc. Natl. Acad. Sci USA 92:8149-8153; Bensen et al. (1995) Plant Cell7:75-84). The latter approach may be accomplished in two ways. First,short segments of the instant nucleic acid fragments may be used inpolymerase chain reaction protocols in conjunction with a mutation tagsequence primer on DNAs prepared from a population of plants in whichMutator transposons or some other mutation-causing DNA element has beenintroduced (see Bensen, supra). The amplification of a specific DNAfragment with these primers indicates the insertion of the mutation tagelement in or near the plant gene encoding the instant polypeptides.Alternatively, the instant nucleic acid fragment may be used as ahybridization probe against PCR amplification products generated fromthe mutation population using the mutation tag sequence primer inconjunction with an arbitrary genomic site primer, such as that for arestriction enzyme site-anchored synthetic adaptor. With either method,a plant containing a mutation in the endogenous gene encoding theinstant polypeptides can be identified and obtained. This mutant plantcan then be used to determine or confirm the natural function of theinstant polypeptides disclosed herein.

[0091] The present invention provides machines, articles of manufacture,and processes for identifying, modeling, or analyzing thepolynucleotides and polypeptides of the present invention.Identification methods permit identification of homologues of thepolynucleotides or polypeptides of the present invention, while modelingand analysis methods permit recognition of structural or functionalfeatures of interest.

[0092] In one embodiment, the present invention provides a machinehaving: 1) a memory comprising data representing at least one geneticsequence, 2) a genetic identification, analysis, or modeling programwith access to the data, 3) a data processor which executes instructionsaccording to the program using the genetic sequence or a subsequencethereof, and 4) an output for storing or displaying the results of thedata processing.

[0093] The machine of the present invention is a data processing system,typically a digital computer. The term “computer” includes one orseveral desktop or portable computers, computer workstations, servers(including intranet or internet servers), mainframes, and any integratedsystem comprising any of the above irrespective of whether theprocessing, memory, input, or output of the computer is remote or local,as well as any network interconnecting the modules of the computer. Dataprocessing can thus be remote or distributed amongst several processorsat a single or multiple sites. The data processing system comprises adata processor, such as a central processing unit (CPU), which executesinstructions according to an application program. As used herein,machines, articles of manufacture, and processes are exclusive of themachines, manufactures, and processes employed by the United StatesPatent and Trademark Office or the European Patent Office forpatentability searches using data representing the sequence of apolypeptide or polynucleotide of the present invention.

[0094] The machine of the present invention further includes a memory,comprising data representing at least one genetic sequence. As usedherein, “genetic sequence” refers to the primary sequence (i.e., aminoacid or nucleotide sequence) of a polynucleotide or polypeptide of thepresent invention. The genetic sequence can represent a partial sequencefrom a full-length protein, genomic DNA, or full-length cDNA/mRNA.Nucleic acids or proteins comprising a genetic sequence that isidentified, analyzed, or modeled according to the present invention canbe cloned or synthesized.

[0095] As those of skill in the art will be aware, the form of memory ofa machine of the present invention, or the particular embodiment of thecomputer readable medium, are not critical elements of the invention andcan take a variety of forms. The memory of such a machine includes, butis not limited to, ROM, RAM, or computer readable media such as, but notlimited to, magnetic media such as computer disks or hard drives, ormedia such as CD-ROMs, DVDs, and the like. The memory comprising thedata representing the genetic sequence includes main memory, a register,and a cache. In some embodiments the data processing system stores thedata representing the genetic sequence in memory while processing thedata and wherein successive portions of the data are copied sequentiallyinto at least one register of the data processor for processing. Thus,the genetic sequence stored in memory can be a genetic sequence createdduring computer runtime or stored beforehand. The machine of the presentinvention includes a genetic identification, analysis, or modelingprogram (discussed below) with access to the data representing thegenetic sequence. The program can be implemented in software orhardware.

[0096] The present invention further contemplates that the machine ofthe present invention will reference, directly or indirectly, a utilityor function for the polynucleotide or polypeptide of the presentinvention. For example, the utility/function can be directly referencedas a data element in the machine and accessible by the program.Alternatively, the utility/function of the genetic can be indirectlyreferenced to an electronic or written record. The function or utilityof the genetic sequence can be a function or utility for the geneticsequence, or the data representing the sequence (i.e., the geneticsequence data). Exemplary function or utilities for the genetic sequenceinclude: 1) its name (per International Union of Biochemistry andMolecular Biology rules of nomenclature) or the function of the enzymeor protein represented by the genetic sequence, 2) the metabolic pathwaythat the protein represented by the genetic sequence participates in, 3)the substrate, product or structural role of the protein represented bythe genetic sequence, or, 4) the phenotype (e.g., an agronomic orpharmacological trait) affected by modulating expression or activity ofthe protein represented by the genetic sequence.

[0097] The machine of the present invention also includes an output fordisplaying, printing, or recording the results of the identification,analysis, or modeling performed using a genetic sequence of the presentinvention. Exemplary outputs include monitors, printers, or variouselectronic storage mechanisms (e.g., floppy disks, hard drives, mainmemory) which can be used to display the results or employed as a meansto input the stored data into a subsequent application or device.

[0098] In some embodiments, data representing a genetic sequence of thepresent invention is a data element within a data structure. The datastructure may be defined by the computer programs that define theprocesses of identification, modeling, or analysis (see below) or it maybe defined by the programming of separate data storage and retrievalprograms, subroutines or systems. Thus, the present invention provides amemory for storing a data structure that can be accessed by a computerprogrammed to implement a process for identification, analysis, ormodeling of a genetic sequence. The data structure, stored withinmemory, is associated with the data representing the genetic sequenceand reflects the underlying organization and structure of the geneticsequence to facilitate program access to data elements corresponding tological sub-components of the genetic sequence. The data structureenables the genetic sequence to be identified, analyzed, or modeled. Theunderlying order and structure of a genetic sequence is datarepresenting the higher order organization of the primary sequence. Suchhigher order structures affect transcription, translation, enzymekinetics, or reflects structural domains or motifs. Exemplary logicalsub-components which constitute the higher order organization of thegenetic sequence include but are not limited to: restriction enzymesites, endopeptidase sites, major grooves, minor grooves, beta-sheets,alpha helices, open reading frames (ORFs), 5′ untranslated regions(UTRs), 3′ UTRs, ribosome binding sites, glycosylation sites, signalpeptide domains, intron-exon junctions, poly-A tails, transcriptioninitiation sites, translation start sites, translation terminationsites, methylation sites, zinc finger domains, modified amino acidsites, preproprotein-proprotein junctions, proprotein-protein junctions,transit peptide domains, single nucleotide polymorphisms (SNPs), simplesequence repeats (SSRs), restriction fragment length polymorphisms(RFLPs), insertion elements, transmembrane spanning regions, andstem-loop structures.

[0099] In another embodiment, the present invention provides a dataprocessing system comprising at least one data structure in memory wherethe data structure supports the accession of data representing a geneticsequence of the present invention. The system also comprises at leastone genetic identification, analysis, or modeling program which directsthe execution of instructions by the system using the genetic sequencedata to identify, analyze, or model at least one data element which is alogical sub-component of the genetic sequence. An output for theprocessing results is also provided.

[0100] In another embodiment, the present invention provides a datastructure in a computer readable medium that contains data representinga genetic sequence of the present invention. The data structure isorganized to reflect the logical structuring of the genetic sequence, sothat the sequence can be analyzed by software programs capable ofaccessing the data structure. In particular, the data structures of thepresent invention organize the genetic sequences of the presentinvention in a manner which allows software tools to perform anidentification, analysis, or modeling using logical elements of eachgenetic sequence.

[0101] In a further embodiment, the present invention provides amachine-readable media containing a computer program and geneticsequence data. The program provides instructions sufficient to implementa process for effecting the identification, analysis, or modeling of thegenetic sequence data. The media also includes a data structurereflecting the underlying organization and structure of the data tofacilitate program access to data elements corresponding to logicalsub-components of the genetic sequence, the data structure beinginherent in the program and in the way in which the program organizesand accesses the data.

[0102] An example of a data structure resembles a layered hash table,where in one dimension the base content of the sequence is representedby a string of elements A, T, C, G and N. The direction from the 5′ endto the 3′ end is reflected by the order from the position 0 to theposition of the length of the string minus one. Such a string,corresponding to a nucleotide sequence of interest, has a certain numberof substrings, each of which is delimited by the string position of its5′ end and the string position of its 3′ end within the parent string.In a second dimension, each substring is associated with or pointed toone or multiple attribute fields. Such attribute fields containannotations to the region on the nucleotide sequence represented by thesubstring.

[0103] For example, a sequence under investigation is 520 bases long andrepresented by a string named SeqTarget. There is a minor groove in the5′ upstream non-coding region from position 12 to 38, which isidentified as a binding site for an enhancer protein HM-A, which in turnwill increase the transcription of the gene represented by SeqTarget.Here, the substring is represented as (12, 38) and has the followingattributes: [upstream uncoded], [minor groove], [HM-A binding] and[increase transcription upon binding by HM-A]. Similarly, other types ofinformation can be stored and structured in this manner, such asinformation related to the whole sequence, e.g., whether the sequence isa full length viral gene, a mammalian house keeping gene, an EST fromclone X, or information related to the 3′ down stream non-coding region,e.g., hairpin structure, and information related to various domains ofthe coding region, e.g., Zinc finger.

[0104] This data structure is an open structure and is robust enough toaccommodate newly generated data and acquired knowledge. Such astructure is also a flexible structure. It can be trimmed down to a 1-Dstring to facilitate data mining and analysis steps, such as clustering,repeat-masking, and HMM analysis. Meanwhile, such a data structure alsocan extend the associated attributes into multiple dimensions. Pointerscan be established among the dimensioned attributes when needed tofacilitate data management and processing in a comprehensive genomicsknowledgebase. Furthermore, such a data structure is object-oriented.Polymorphism can be represented by a family or class of sequenceobjects, each of which has an internal structure as discussed above. Thecommon traits are abstracted and assigned to the parent object, whereaseach child object represents a specific variant of the family or class.Such a data structure allows data to be efficiently retrieved, updatedand integrated by the software applications associated with the sequencedatabase and/or knowledgebase.

[0105] The present invention also provides a process of identifying,analyzing, or modeling data representing a genetic sequence of thepresent invention. The process comprises: 1) providing a machine havinga hardware or software implemented genetic sequence identification,modeling, or analysis program with data representing a genetic sequence,2) executing the program while granting it access to the geneticsequence data, and 3) displaying or outputting the results of theidentification, analysis, or modeling. Data structures made by theprocesses of the present invention and embodied within a computerreadable medium are also provided herein.

[0106] A further process of the present invention comprises providing amemory embodied with data representing a genetic sequence and developingwithin the memory a data structure associated with the data andreflecting the underlying organization and structure of the data tofacilitate program access to data elements corresponding to logicalsub-components of the sequence. A computer is programmed with a programcontaining instructions sufficient to implement the process foreffecting the identification, analysis, or modeling of the geneticsequence and the program is executed on the computer while granting theprogram access to the data and to the data structure within the memory.The program results are outputted.

[0107] Identification, analysis, and modeling programs are well known inthe art and available commercially. The program typically has at leastone application to: 1) identify the structural role or enzymaticfunction of the gene which the genetic sequence encodes or is translatedfrom, 2) analyzes and identifies higher order structures within thegenetic sequence or, 3) model the physico-chemical properties of agenetic sequence of the present invention in a particular environment.

[0108] Included amongst the modeling/analysis tools are methods to: 1)recognize overlapping sequences (e.g., from a sequencing project) with apolynucleotide of the present invention and create an alignment called a“contig” ; 2) identify restriction enzyme sites of a polynucleotide ofthe present invention; 3) identify the products of a T1 ribonucleasedigestion of a polynucleotide of the present invention; 4) identify PCRprimers with minimal self-complementarity; 5) compute pairwise distancesbetween sequences in an alignment, reconstruct phylogentic trees usingdistance methods, and calculate the degree of divergence of two proteincoding regions; 6) identify patterns such as coding regions,terminators, repeats, and other consensus patterns in polynucleotides ofthe present invention; 7) identify RNA secondary structure; 8) identifysequence motifs, isoelectric point, secondary structure, hydrophobicity,and antigenicity in polypeptides of the present invention; 9) translatepolynucleotides of the present invention and backtranslate polypeptidesof the present invention; and 10) compare two protein or nucleic acidsequences and identifying points of similarity or dissimilarity betweenthem.

[0109] Identification of the function/utility of a genetic sequence istypically achieved by comparative analysis to a gene/protein databaseand establishing the genetic sequence as a candidate homologue (i.e.,ortholog or paralog) of a gene/protein of known function/utility. Acandidate homologue has statistically significant probability of havingthe same biological function (e.g., catalyzes the same reaction, bindsto homologous proteins/nucleic acids, has a similar structural role) asthe reference sequence to which it is compared. Sequenceidentity/similarity is frequently employed as a criterion to identifycandidate homologues. In the same vein, genetic sequences of the presentinvention have utility in identifying homologs in animals or other plantspecies, particularly those in the family Gramineae such as, but notlimited to, sorghum, wheat, or rice. Function is frequently establishedon the basis of sequence identity/similarity.

[0110] Exemplary sequence comparison systems are provided for insequence analysis software such as those provided by the GeneticsComputer Group (Madison, Wis.) or InforMax (Bethesda, Md.), orIntelligenetics (Mountain View, Calif.). Optionally, sequence comparisonis established using the BLAST or GAP suite of programs. Generally, asmallest sum probability value (P(N)) of less than 0.1, oralternatively, less than 0.01, 0.001, 0.0001, or 0.00001 using the BLAST2.0 suite of algorithms under default parameters identifies the testsequence as a candidate homologue (i.e., an allele, ortholog, orparalog) of a reference sequence. Those of skill in the art willrecognize that a candidate homologue has an increased statisticalprobability of having the same or similar function as the gene/proteinrepresented by the test sequence.

[0111] The software/hardware for effecting identification, analysis, ormodeling can be produced independently or obtained from commercialsuppliers. Exemplary identification, analysis, and modeling tools areprovided in products such as InforMax's (Bethesda, Md.) Vector NTI Suite(Version 5.5), Intelligenetics' (Mountain View, Calif.) PC/Gene program,and Genetics Computer Group's (Madison, Wis.) Wisconsin Package (Version10.0); these tools, and the functions they perform, (as provided anddisclosed by the programs and accompanying literature) are incorporatedherein by reference.

EXAMPLES

[0112] The present invention is further defined in the followingExamples, in which parts and percentages are by weight and degrees areCelsius, unless otherwise stated. It should be understood that theseExamples, while indicating preferred embodiments of the invention, aregiven by way of illustration only and are not to limit the scope of theinvention. From the above discussion and these Examples, one skilled inthe art can ascertain the essential characteristics of this invention,and without departing from the spirit and scope thereof, can makevarious changes and modifications of the invention to adapt it tovarious usages and conditions. Thus, various modifications of theinvention in addition to those shown and described herein will beapparent to those skilled in the art from the foregoing description.Such modifications are also intended to fall within the scope of theappended claims.

[0113] The disclosure of all publications, patents, patent applications,and computer programs cited herein are hereby incorporated by referencein their entirety.

Example 1 Composition of cDNA Libraries; Isolation and Sequencing ofcDNA Clones

[0114] cDNA libraries representing mRNAs from various corn, Jerusalemartichoke, rice, soybean, and wheat tissues were prepared. Thecharacteristics of the libraries are described below. TABLE 2 cDNALibraries from Corn, Jerusalem Artichoke, Rice, Soybean, and WheatLibrary Tissue Clone cco1n Corn Cob of 67 Day Old Plants Grown in GreenHouse¹ cco1n.pk055.115 cco1n.pk077.o18 cen3n Corn Endosperm 20 DaysAfter Pollination¹ cen3n.pk0155.f12 he11 Jerusalem Artichoke Tuber atFilling Stage he11.pk0013.b1 p0010 Corn Log Phase Suspension CellsTreated With p0010.cbpaa44rb A23187² to Induce Mass Apoptosisp0010.cbpaa44rd p0010.cbpco75rb p0014 Corn Leaves 7 and 8 from PlantTransformed With p0014.ctusq39r G-protein Gene, C. heterostrophusResistant p0026 Corn Regenerating Callus 5 Days After Auxin Removalp0026.ccrbd22r p0083 Corn Whole Kernels 7 Days After Pollinationp0083.c1daz07r p0094 Corn Leaf Collars for the Ear Leaf (EL) and thep0094.cssth92ra Next Leaf Above and Below the EL¹ p0100 Corn CoenocyticEmbryo Sacs 4 Days After Pollination¹ p0100.cbaaj24r p0104 Corn RootsStage V5³, Infested With Corn Root Worm¹ p0104.cabad88rb p0107 CornWhole Kernels 7 Days After Pollination¹ p0107.cbcap19r p0119 CornV12-Stage^(3 Ear Shoot With Husk, Night Harvested) ¹ p0119.cmtne90rp0119.cmtnr87r:fis p0119.cmtoj48r:fis p0127 Corn Nucellus Tissue, 5 DaysAfter Silking¹ p0127.cntam18r p0127.cntar92r p0129 H08 Lazy MutantInternode Tissue p0129.c1mad36r:fis rca1n Rice Callus¹rca1n.pk007.p13:fis rds1c Rice Developing Seeds rds1c.pk005.c17:fisrds1c.pk007.e9:fis r10n Rice 15 Day Old Leaf¹ r10n.pk0039.b7:fisr10n.pk0063.e10 r10n.pk127.m10:fis r10n.pk136.o14 r1r6 Rice Leaf 15 DaysAfter Germination, 6 Hours After r1r6.pk0025.h9 Infection of StrainMagaporthe grisea 4360-R-62 r1r6.pk0074.e9 (AVR2-YAMO); Resistantr1r6.pk0083.e10:fis r1s2 Rice Leaf 15 Days After Germination, 2 HoursAfter r1s2.pk0022.d7 Infection of Strain Magaporthe grisea 4360-R-67(AVR2-YAMO); Susceptible r1s6 Rice Leaf 15 Days After Germination, 6Hours After r1s6.pk0059.b8 Infection of Strain Magaporthe grisea4360-R-67 (AVR2-YAMO); Susceptible rr1 Rice Root of Two Week OldDeveloping Seedling rr1.pk0004.a2 rr1.pk0043.f8 rs11n Rice 15-Day-OldSeedling¹ rs11n.pk013.14 sdp2c Soybean Developing Pods (6-7 mm)sdp2c.pk009.b13 sdp3c Soybean Developing Pods (8-9 mm)sdp3c.pk006.d23:fis sdp4c Soybean Developing Pods (10-12 mm)sdp4c.pk014.k19 se3 Soybean Embryo, 17 Days After Flowering se3.02c07se5 Soybean Embryo, 21 Days After Flowering se5.pk0029.d2 sf11 SoybeanImmature Flower sf11.pk128.a18:fis sgc2c Soybean Cotyledon 12-20 DaysAfter Germination sgs2c.pk004.h13 (Mature Green) sgc4c Soybean Cotyledon14-21 Days After Germination sgs4c.pk004.c18 (1/4 yellow) sic1c SoybeanRoot, Stem, and Leaf Tissue With Iron sic1c.pk001.e18:fis Chlorosis,Pooled s12 Soybean Two-Week-Old Developing Seedlings s12.pk121.m20:fisTreated With 2.5 ppm chlorimuron s1s2c Soybean Infected With Scierotiniascierotiorum s1s2c.pk005.m4:fis Mycelium s1s2c.pk037.c11 sr1 SoybeanRoot sr1.pk0073.f1 src1c Soybean 8 Day Old Root Infected With CystNematode src1c.pk001.a5:fis src2c Soybean 8 Day Old Root Infected WithCyst Nematode src2c.pk023.f15 src3c Soybean 8 Day Old Root Infected WithCyst Nematode src3c.pk012.d7 srm Soybean Root Meristem srm.pk0035.c1:fiswdk1c Wheat Developing Kernel, 3 Days After Anthesis wdk1c.pk012.n13:fiswdr1 Wheat Developing Root and Leaf wdr1.pk0005.f7:fis wkm2c WheatKernel Malted 175 Hours at 4 Degrees Celsius wkm2c.pk0002.a3 w11n WheatLeaf From 7 Day Old Seedling¹ w11n.pk0005.c8 w11n.pk0054.d8w11n.pk0095.f3:fis w1k4 Wheat Seedlings 4 Hours After Treatment WithHerbicide⁴ w1k4.pk0022.b7 w1m0 Wheat Seedlings 0 Hour After InoculationWith w1m0.pk0028.h3:fis Erysiphe graminis f.sp tritici w1m24 WheatSeedlings 24 Hours After Inoculation With w1m24.pk0018.g9 Erysiphegraminis f.sp tritici w1m96 Wheat Seedlings 96 Hours After InoculationWith w1m96.pk044.g9 Erysiphe graminis f.sp tritici w1mk1 Wheat Seedlings1 Hour After Inoculation With w1mk1.pk0001.g6:fis Erysiphe graminis f.sptritici and Treatment With Herbicide⁴ wr1 Wheat Root From 7 Day OldSeedling wr1.pk0067.h2 wr1.pk0076.a11 wr1.pk178.b5 wre1n Wheat Root From7 Day Old Etiolated Seedling¹ wre1n.pk0079.c6 wre1n.pk160.d1:fis

[0115] cDNA libraries may be prepared by any one of many methodsavailable. For example, the cDNAs may be introduced into plasmid vectorsby first preparing the cDNA libraries in Uni-ZAP™ XR vectors accordingto the manufacturer's protocol (Stratagene Cloning Systems, La Jolla,Calif.). The Uni-ZAP™ XR libraries are converted into plasmid librariesaccording to the protocol provided by Stratagene. Upon conversion, cDNAinserts is will be contained in the plasmid vector pBluescript. Inaddition, the cDNAs may be introduced directly into precut Bluescript IISK(+) vectors (Stratagene) using T4 DNA ligase (New England Biolabs),followed by transfection into DH10B cells according to themanufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts arein plasmid vectors, plasmid DNAs are prepared from randomly pickedbacterial colonies containing recombinant pBluescript plasmids, or theinsert cDNA sequences are amplified via polymerase chain reaction usingprimers specific for vector sequences flanking the inserted cDNAsequences. Amplified insert DNAs or plasmid DNAs are sequenced indye-primer sequencing reactions to generate partial cDNA sequences(expressed sequence tags or “ESTs”; see Adams et al., (1991) Science252:1651-1656). The resulting ESTs are analyzed using a Perkin ElmerModel 377 fluorescent sequencer.

[0116] Full-insert sequence (FIS) data is generated utilizing a modifiedtransposition protocol. Clones identified for FIS are recovered fromarchived glycerol stocks as single colonies, and plasmid DNAs areisolated via alkaline lysis. Isolated DNA templates are reacted withvector primed M13 forward and reverse oligonucleotides in a PCR-basedsequencing reaction and loaded onto automated sequencers. Confirmationof clone identification is performed by sequence alignment to theoriginal EST sequence from which the FIS request is made.

[0117] Confirmed templates are transposed via the Primer Islandtransposition kit (PE Applied Biosystems, Foster City, Calif.) which isbased upon the Saccharomyces cerevisiae Ty1 transposable element (Devineand Boeke (1994) Nucleic Acids Res. 22:3765-3772). The in vitrotransposition system places unique binding sites randomly throughout apopulation of large DNA molecules. The transposed DNA is then used totransform DH10B electro-competent cells (Gibco BRL/Life Technologies,Rockville, Md.) via electroporation. The transposable element containsan additional selectable marker (named DHFR; Fling and Richards (1983)Nucleic Acids Res. 11:5147-5158), allowing for dual selection on agarplates of only those subclones containing the integrated transposon.Multiple subclones are randomly selected from each transpositionreaction, plasmid DNAs are prepared via alkaline lysis, and templatesare sequenced (ABI Prism dye-terminator ReadyReaction mix) outward fromthe transposition event site, utilizing unique primers specific to thebinding sites within the transposon.

[0118] Sequence data is collected (ABI Prism Collections) and assembledusing Phred/Phrap (P. Green, University of Washington, Seattle).Phrep/Phrap is a public domain software program which re-reads the ABIsequence data, re-calls the bases, assigns quality values, and writesthe base calls and quality values into editable output files. The Phrapsequence assembly program uses these quality values to increase theaccuracy of the assembled sequence contigs. Assemblies are viewed by theConsed sequence editor (D. Gordon, University of Washington, Seattle).

Example 2 Identification of cDNA Clones

[0119] cDNA clones encoding respiratory burst oxidase homologs,methyltransferases, methylases, phospholipases, transcription factors,aminoacyl-tRNA synthetases, AP2 subunits, or EDS1 were identified byconducting BLAST (Basic Local Alignment Search Tool; Altschul et al.(1993) J. Mol. Biol. 215:403-410; see also www.ncbi.nlm.nih.gov/BLAST/)searches for similarity to sequences contained in the BLAST “nr”database (comprising all non-redundant GenBank CDS translations,sequences derived from the 3-dimensional structure Brookhaven ProteinData Bank, the last major release of the SWISS-PROT protein sequencedatabase, EMBL, and DDBJ databases). The cDNA sequences obtained inExample 1 were analyzed for similarity to all publicly available DNAsequences contained in the “nr” database using the BLASTN algorithmprovided by the National Center for Biotechnology Information (NCBI).The DNA sequences were translated in all reading frames and compared forsimilarity to all publicly available protein sequences contained in the“nr” database using the BLASTX algorithm (Gish and States (1993) Nat.Genet. 3:266-272) provided by the NCBI. For convenience, the P-value(probability) of observing a match of a cDNA sequence to a sequencecontained in the searched databases merely by chance as calculated byBLAST are reported herein as “pLog” values, which represent the negativeof the logarithm of the reported P-value. Accordingly, the greater thepLog value, the greater the likelihood that the cDNA sequence and theBLAST “hit” represent homologous proteins.

[0120] ESTs submitted for analysis are compared to the genbank databaseas described above. ESTs that contain sequences more 5-prime or 3-primecan be found by using the BLASTN algorithm (Altschul et al (1997)Nucleic Acids Res. 25:3389-3402.) against the Du Pont proprietarydatabase comparing nucleotide sequences that share common or overlappingregions of sequence homology. Where common or overlapping sequencesexist between two or more nucleic acid fragments, the sequences can beassembled into a single contiguous nucleotide sequence, thus extendingthe original fragment in either the 5-prime or 3-prime direction. Oncethe most 5-prime EST is identified, its complete sequence can bedetermined by Full Insert Sequencing as described in Example 1.Homologous genes belonging to different species can be found bycomparing the amino acid sequence of a known gene (from either aproprietary source or a public database) against an EST database usingthe TBLASTN algorithm. The TBLASTN algorithm searches an amino acidquery against a nucleotide database that is translated in all 6 readingframes. This search allows for differences in nucleotide codon usagebetween different species, and for codon degeneracy.

Example 3 Characterization of cDNA Clones Encoding RbohA

[0121] The BLASTX search using the EST sequences from clones listed inTable 3 revealed similarity of the polypeptides encoded by the Contig torespiratory burst oxidase homolog A (RbohA) from Arabidopsis thaliana(NCBI General Identifier No. 3242781). Shown in Table 3 are the BLASTresults for individual ESTs (“EST”): TABLE 3 BLAST Results for SequencesEncoding Polypeptides Homologous to RbohA BLAST pLog Score Clone Status3242781 (Arabidopsis thaliana) p0010.cbpco75rb EST 46.40 r1r6.pk0025.h9EST 69.00 w11n.pk0005.c8 EST 53.00

[0122] The sequence of the entire cDNA insert in the clones listed inTable 3 was determined. The BLASTX search using the EST sequences fromclones listed in Table 4 revealed similarity of the polypeptides encodedby the Contig to RbohA from Arabidopsis thaliana (NCBI GeneralIdentifier No. 3242781) and by the by the Contig to RbohB fromArabidopsis thaliana (NCBI General Identifier No. 3242783). Shown inTable 4 are the BLAST results for the sequences of the entire cDNAinserts comprising the indicated cDNA clones (“FIS”): TABLE 4 BLASTResults for Sequences Encoding Polypeptides Homologous to Arabidopsisthaliana RbohA and RbohB BLAST pLog Score Clone Status 3242781 (RbohA)3242783 (RbohB) p0010.cbpco75rb:fis FIS 56.40 60.52 r1r6.pk0025.h9:fisFIS 63.00 59.70 w11n.pk0005.c8:fis FIS 54.22 51.70

[0123] The data in Table 5 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:2, 4, 6, 8,10, and 12 and the Arabidopsis thaliana RbohA and RbohB sequences (NCBIGeneral Identifier Nos. 3242781 and 3242783, respectively). TABLE 5Percent Identity of Amino Acid Sequences Deduced From the NucleotideSequences of cDNA Clones Encoding Polypeptides Homologous to Arabidopsisthaliana RbohA and RbohB Percent Identity to SEQ ID NO. 3242781 (RbohA)3242783 (RbohB) 2 57.5 55.2 4 83.6 75.0 6 79.5 73.0 8 60.0 62.4 10 82.576.6 12 80.6 75.8

[0124] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.) Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters(GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, and BLAST scoresand probabilities indicate that the nucleic acid fragments comprisingthe instant cDNA clones encode substantial portions of a corn, a rice,and a wheat respiratory burst oxidase homolog.

Example 4 Characterization of cDNA Clones Encoding RbohB

[0125] The BLASTX search using the EST sequences from clones listed inTable 6 revealed similarity of the polypeptides encoded by the cDNAs torespiratory burst oxidase homolog B (RbohB) from Arabidopsis thaliana(NCBI General Identifier No. 3242783). Shown in Table 6 are the BLASTresults for individual ESTs (“EST”): TABLE 6 BLAST Results for SequencesEncoding Polypeptides Homologous to RbohB BLAST pLog Score Clone Status3242783 (Arabidopsis thaliana) p0010.cbpaa44rd EST 86.00 r1s2.pk0022.d7EST 35.40 src2c.pk023.f15 EST 52.70 w11n.pk0054.d8 EST 35.00

[0126] The sequence of the entire cDNA insert in the rice, soybean, andwheat clones listed in Table 6 was determined. The BLASTX search usingthe EST sequences from clones listed in Table 7 revealed similarity ofthe polypeptides encoded by the cDNAs to RbohB and RbohD fromArabidopsis thaliana (NCBI General Identifier Nos. 3242783 and 3242789,respectively). Shown in Table 7 are the BLAST results for the sequencesof the entire cDNA inserts comprising the indicated cDNA clones (“FIS”):TABLE 7 BLAST Results for Sequences Encoding Polypeptides Homologous toArabidopsis thaliana RbohB and RbohD BLAST pLog Score Clone Status3242783 (RbohB) 3242789 (RbohD) r1s2.pk0022.d7:fis FIS 123.00 127.00src2c.pk023.f15:fis FIS 60.15 62.40 w11n.pk0054.d8:fis FIS 71.70 67.30

[0127] The data in Table 8 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:14, 16, 18,20, 22, 24, and 26 and the Arabidopsis thaliana RbohB and RbohDsequences (NCBI General Identifier Nos. 3242783 and 3242789,respectively). TABLE 8 Percent Identity of Amino Acid Sequences DeducedFrom the Nucleotide Sequences of cDNA Clones Encoding PolypeptidesHomologous to Arabidopsis thaliana RbohB and RbohD Percent Identity toSEQ ID NO. 3242783 (RbohB) 3242789 (RbohD) 14 60.5 58.7 16 73.7 69.7 1870.1 57.6 20 52.2 47.8 22 63.9 63.3 24 42.3 42.3 26 65.8 58.4

[0128] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn, a rice, asoybean, and a wheat RbohB.

Example 5 Characterization of cDNA Clones Encoding RbohC

[0129] The BLASTX search using the EST sequences from clones listed inTable 9 revealed similarity of the polypeptides encoded by the cDNAs torespiratory burst oxidase homolog C (RbohC) from Arabidopsis thaliana(NCBI General Identifier No.3242785). Shown in Table 9 are the BLASTresults for individual ESTs (“EST”): TABLE 9 BLAST Results for SequencesEncoding Polypeptides Homologous to RbohC BLAST pLog Score Clone Status3242785 (Arabidopsis thaliana) r1r6.pk0074.e9 EST 60.10

[0130] The sequence of the entire cDNA insert in the clone listed inTable 9 was determined. The BLASTX search using the EST sequences fromclones listed in Table 10 revealed similarity of the polypeptidesencoded by the cDNAs to RbohC from Arabidopsis thaliana (NCBI GeneralIdentifier No. 3242785). Shown in Table 10 are the BLAST results for thesequences of the entire cDNA insert comprising the indicated cDNA clone(“FIS”): TABLE 10 BLAST Results for Sequences Encoding PolypeptidesHomologous RbohC BLAST pLog Score Clone Status 3242785 (Arabidopsisthaliana) r1r6.pk0074.e9:fis FIS 64.00

[0131] The data in Table 11 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:28 and 30and the Arabidopsis thaliana sequence (NCBI General Identifier No.3242785). TABLE 11 Percent Identity of Amino Acid Sequences Deduced Fromthe Nucleotide Sequences of cDNA Clones Encoding Polypeptides Homologousto RbohC Percent Identity to SEQ ID NO. 3242785 (Arabidopsis thaliana)28 59.8 30 60.9

[0132] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a rice RbohC.

Example 6 Characterization of cDNA Clones Encoding RbohD

[0133] The BLASTX search using the EST sequences from clones listed inTable 12 revealed similarity of the polypeptides encoded by the cDNAs torespiratory burst oxidase homolog D (RbohD) from Arabidopsis thaliana(NCBI General Identifier No. 3242789). Shown in Table 12 are the BLASTresults for individual ESTs (“EST”), or for the sequences of contigsassembled from two or more ESTs (“Contig”): TABLE 12 BLAST Results forSequences Encoding Polypeptides Homologous to RbohD BLAST pLog ScoreClone Status 3242789 (Arabidopsis thaliana) Contig of: Contig 106.00cco1n.pk055.115 p0127.cntar92r rr1.pk0004.a2 EST 56.05 sr1.pk0073.f1 EST61.40 w1m96.pk044.g9 EST 41.00

[0134] The sequence of the entire cDNA insert in the rice, soybean, andwheat clones listed in Table 12 was determined. The BLASTX search usingthe EST sequences from clones listed in Table 13 revealed similarity ofthe polypeptides encoded by the cDNAs to RbohD from Arabidopsis thaliana(NCBI General Identifier No. 3242789). Shown in Table 13 are the BLASTresults for the sequences of the entire cDNA inserts comprising theindicated cDNA clones (“FIS”): TABLE 13 BLAST Results for SequencesEncoding Polypeptides Homologous to RbohD BLAST pLog Score Clone Status3242789(Arabidopsis thaliana) rr1.pk0004.a2:fis FIS >254.00sr1.pk0073.f1:fis FIS >254.00 w1m96.pk044.g9:fis FIS >254.00

[0135] The data in Table 14 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:32, 34, 36,38, 40, 42, and 44 and the Arabidopsis thaliana sequence (NCBI GeneralIdentifier No. 3242789). TABLE 14 Percent Identity of Amino AcidSequences Deduced From the Nucleotide Sequences of cDNA Clones EncodingPolypeptides Homologous to RbohD Percent Identity to SEQ ID NO. 3242789(Arabidopsis/thaliana) 32 64.5 34 75.8 36 63.5 38 51.0 40 73.7 42 66.144 71.1

[0136] Sequence alignments and percent identity calculations wereperformed using the Magalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.) Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn, a rice, asoybean, and a wheat RbohD.

Example 7 Characterization of cDNA Clones Encoding Respiratory BurstOxidase Protein (Rboh)

[0137] The BLASTX search using the EST sequences from clones listed inTable 15 revealed similarity of the polypeptides encoded by the cDNAs torespiratory burst oxidase homolog (Rboh) from Arabidopsis thaliana andOryza sativa (NCBI General Identifier Nos. 2654868 and 2654870respectively). Shown in Table 15 are the BLAST results for individualESTs (“EST”): TABLE 15 BLAST Results for Sequences Encoding PolypeptidesHomologous to Respiratory Burst Oxidase Protein NCBI BLAST pLog CloneStatus General Accession No. Score sdp2c.pk009.b13 EST 2654868(Arabidopsis thaliana) 50.70 p0104.cabad88rb EST 2654870 (Oryza sativa)93.70 rs11n.pk013.i4 EST 2654870 (Oryza sativa) 60.22

[0138] The sequence of the entire cDNA insert in the clones listed inTable 15 was determined. The BLASTX search using the EST sequences fromclones listed in Table 16 revealed similarity of the polypeptidesencoded by the cDNAs to respiratory burst oxidase protein fromArabidopsis thaliana and Oryza sativa (NCBI General Identifier Nos.7484893 and 7489460, respectively). The sequence having NCBI GeneralIdentifier No. 7484893 is 100% identical to the sequence having NCBIGeneral Identifier No. 2654868, and the sequence having NCBI GeneralIdentifier No. 7489460 is 100% identical to the sequence having NCBIGeneral Identifier No. 2654870. Shown in Table 16 are the BLAST resultsfor the sequences of the entire cDNA inserts comprising the indicatedcDNA clones (“FIS”): TABLE 16 BLAST Results for Sequences EncodingPolypeptides Homologous to Respiratory Burst Oxidase Protein BLAST pLogScore 7484893 7489460 Clone Status (A. thaliana) (O. sativa)p0104.cabad88rb:fis FIS >254.00 >254.00 rs11n.pk013.i4:fisFIS >254.00 >254.00 sdp2c.pk009.b13:fis FIS 72.52 68.00

[0139] The data in Table 17 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:46, 48, 50,52, 54, and 56 and the Arabidopsis thaliana and Oryza sativa sequences(NCBI General Identifier Nos. 7484893 and 7489460, respectively). TABLE17 Percent Identity of Amino Acid Sequences Deduced From the NucleotideSequences of cDNA Clones Encoding Polypeptides Homologous to RespiratoryBurst Oxidase Protein Percent Identity to SEQ ID NO. 7484893 (A.thaliana) 7489460 (O. sativa) 46 62.3 81.9 48 65.5 91.8 50 100.0 92.3 5275.5 93.7 54 73.7 91.7 56 88.8 83.9

[0140] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn, a rice, and asoybean respiratory burst oxidase protein.

Example 8 Characterization of cDNA Clones Encoding Respiratory BurstOxidase Homolog E (RbohE)

[0141] The BLASTX search using the EST sequences from clones listed inTable 18 revealed similarity of the polypeptides encoded by the cDNAs toRbohE from Arabidopsis thaliana (NCBI General Identifier No. 3242787).Shown in Table 18 are the BLAST results for individual ESTs (“EST”):TABLE 18 BLAST Results for Sequences Encoding Polypeptides Homologous toRbohE BLAST pLog Score Clone Status 3242787 (Arabidopsis thaliana)cen3n.pk0155.f12 EST 60.40 se3.02c07 EST 18.70 wr1.pk178.b5 EST 60.70

[0142] The sequence of the entire cDNA insert in the corn and wheatclones listed in Table 18 was determined. The BLASTX search using theEST sequences from clones listed in Table 19 revealed similarity of thepolypeptides encoded by the cDNAs to RbohE from Arabidopsis thaliana(NCBI General Identifier No. 3242787). Shown in Table 19 are the BLASTresults for the sequences of the entire cDNA inserts comprising theindicated cDNA clones (“FIS”): TABLE 19 BLAST Results for SequencesEncoding Polypeptides Homologous to RbohE BLAST pLog Score Clone Status3242787 (Arabidopsis thaliana) cen3n.pk0155.f12:fis FIS 155.00wr1.pk178.b5:fis FIS 139.00

[0143] The data in Table 20 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:58, 60, 62,64, and 66 and the Arabidopsis thaliana sequence (NCBI GeneralIdentifier No. 3242787). TABLE 20 Percent Identity of Amino AcidSequences Deduced From the Nucleotide Sequences of cDNA Clones EncodingPolypeptides Homologous to RbohE Percent Identity to SEQ ID NO. 3242787(Arabidopsis thaliana) 58 74.4 60 33.6 62 72.1 64 62.2 66 61.8

[0144] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode subsitantial portions of a corn, a soybean,and a wheat RbohE.

Example 9 Characterization of cDNA Clones Encoding RbohF

[0145] The BLASTX search using the EST sequences from clones listed inTable 21 revealed similarity of the polypeptides encoded by the cDNAs toRbohF from Arabidopsis thaliana (NCBI General Identifier No. 3242456).Shown in Table 21 are the BLAST results for individual ESTs (“EST”):TABLE 21 BLAST Results for Sequences Encoding Polypeptides Homologous toRbohF BLAST pLog Score Clone Status 3242456 (Arabidopsis thaliana)p0010.cbpaa44rb EST 61.00 sdp4c.pk014.k19 EST 22.10

[0146] The sequenced of the entire cDNA insert in the clones listed inTable 21 was determined. The BLASTX search using the EST sequences fromclones listed in Table 22 revealed similarity of the polypeptidesencoded by the cDNAs to phox homolog from Lycopersicon esculentum (NCBIGeneral Identifier No. 4585142) and to RbohF from Arabidopsis thaliana(NCBI General Identifier No. 7484893). There is one amino aciddifference (Thr to Ile at position 908) between the Arabidopsis thalianasequences having NCBI General Identifier Nos. 3242456 and 7484893. Shownin Table 22 are the BLAST results for the sequences of the entire cDNAinserts comprising the indicated cDNA clones (“FIS”): TABLE 22 BLASTResults for Sequences Encoding Polypeptides Homologous to RbohF BLASTpLog Score 4585142 7484893 Clone Status (L. esculentum) (A. thaliana)p0010.cbpaa44rb:fis FIS >254.00 >254.00 sdp4c.pk014.k19:fis FIS 34.4032.40

[0147] The data in Table 23 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:68, 70, 72,and 74 and the Lycopersicon esculentum and Arabidopsis thalianasequences (NCBI General Identifier Nos. 4585142 and 7484893,respectively). TABLE 23 Percent Identity of Amino Acid Sequences DeducedFrom the Nucleotide Sequences of cDNA Clones Encoding PolypeptidesHomologous to RbohF Percent Identity to SEQ ID NO. 4585142 (L.esculentum) 7484893 (A. thaliana) 68 50.8 52.5 70 88.9 77.8 72 59.1 58.674 73.1 69.2

[0148] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn and a soybeanRbohF.

Example 10 Characterization of cDNA Clones Encoding tRNA-mnm⁵s²U-MT

[0149] The BLASTX search using the EST sequences from clones listed inTable 24 revealed similarity of the polypeptides encoded by the cDNAs totRNA-mnm⁵s²U-MT from Borrelia burgdorferi (NCBI General Identifier No.2688619). Shown in Table 24 are the BLAST results for individual ESTs(“EST”): TABLE 24 BLAST Results for Sequences Encoding PolypeptidesHomologous to tRNA-mnm⁵s²U-MT BLAST pLog Score Clone Status 2688619(Borrelia burgdorferi) cco1n.pk077.o18 EST 29.70 se5.pk0029.d2 EST 11.10

[0150] The sequence of the entire cDNA insert in the clones listed inTable 24 was determined. The BLASTX search using the EST sequences fromclones listed in Table 25 revealed similarity of the polypeptidesencoded by the Contigs to a conserved hypothetical protein from Borreliaburgdorferi (NCBI General Identifier No. 2688619) and to a protein withsimilarities to tRNA-mnm⁵s²U-MT from Arabidopsis thaliana (NCBI GeneralIdentifier No. 4836940). Shown in Table 25 are the BLAST results for thesequences of the entire cDNA inserts comprising the indicated cDNAclones (“FIS”): TABLE 25 BLAST Results for Sequences EncodingPolypeptides Homologous to tRNA-mnm⁵s²U-MT BLAST pLog Score Clone Status2688619 4836940 cco1n.pk077.o18:fis FIS 67.70 127.00 se5.pk0029.d2:fisFIS 94.40 >254.00

[0151] The data in Table 26 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:76, 78, 80,and 82 and the Borrelia burgdorferi and Arabidopsis thaliana sequences(NCBI General Identifier Nos. 2688619 and 4836940. respectively). TABLE26 Percent Identity of Amino Acid Sequences Deduced From the NucleotideSequences of cDNA Clones Encoding Polypeptides Homologous totRNA-mnm⁵s²U-MT Percent Identity to SEQ ID NO. 2688619 4836940 76 44.469.4 78 34.9 77.1 80 34.2 65.2 82 41.4 80.9

[0152] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn and a soybeantRNA-mnm⁵s²U-MT.

Example 11 Characterization of cDNA Clones Encoding Chromomethylase

[0153] The BLASTX search using the EST sequences from clones listed inTable 27 revealed similarity of the polypeptides encoded by the contigsto chromomethylase from Arabidopsis thaliana (NCBI General IdentifierNos. 2865416 and 2865422) and from Arabidopsis arenosa (NCBI GeneralIdentifier No. 2766715). Shown in Table 27 are the BLAST results forindividual ESTs (“EST”), or for the sequences of the entire cDNA insertscomprising the indicated cDNA clones (“FIS”): TABLE 27 BLAST Results forSequences Encoding Polypeptides Homologous to Chromomethylase BLAST pLogScore 2865416 2865422 2766715 Clone Status (A. thaliana) (A. thaliana)(A. arenosa) he11.pk0013.b1 FIS >254.00 >254.00 >254.00 p0094.cssth92raEST 32.15 31.22 32.40 r10n.pk136.o14 EST 10.70 10.52 10.40w11n.pk0095.f3 FIS 73.70 72.70 71.70 w1m0.pk0028.h3 FIS 9.40 9.40 3.30

[0154] The sequence of the entire CDNA insert in the clones listed inTable 27 was determined. The BLASTX search using the EST sequences fromclones listed in Table 28 revealed similarity of the polypeptidesencoded by the Contig to a putative chromomethylase from Arabidopsisthaliana (NCBI General Identifier No. 6665556) and by cDNAs tochromomethylases from Arabidopsis thaliana (NCBI General Identifier Nos.2865422 and 2865416). Shown in Table 28 are the BLAST results for thesequences of the entire cDNA inserts comprising the indicated cDNAclones (“FIS”), or for the sequences of FISs encoding the entire protein(“CGS”): TABLE 28 BLAST Results for Sequences Encoding PolypeptidesHomologous to Chromomethylase BLAST pLog Score 6665556 2865422 2865416Clone Status (A. thaliana) (A. thaliana) (A. thaliana)he11.pk0013.b1:fis CGS >254.00 >254.00 p0094.cssth92ra:fis FIS 68.0057.22 58.15 r10n.pk136.o14:fis FIS 57.15 41.40 41.30 srm.pk0035.c1:fisFIS 115.00 114.00 113.00

[0155] The data in Table 29 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:84, 86, 88,90, 92, 94, 96, 98, and 100 and the Arabidopsis thaliana sequences (NCBIGeneral Identifier Nos. 6665556, 2865422, and 2865416). TABLE 29 PercentIdentity of Amino Acid Sequences Deduced From the Nucleotide Sequencesof cDNA Clones Encoding Polypeptides Homologous to ChromomethylasePercent Identity to 6665556 2865422 2865416 SEQ ID NO. (A. thaliana) (A.thaliana) (A. thaliana) 84 49.2 46.7 46.7 86 43.5 38.0 38.6 88 21.3 23.423.4 90 50.0 56.5 56.5 92 57.2 49.6 50.0 94 46.7 45.1 45.1 96 54.2 46.647.1 98 45.1 36.5 36.5 100 57.6 55.2 55.2

[0156] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.) Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of an artichoke, a corn,a rice, and two wheat chromomethylases and an entire artichokechromomethylase.

Example 12 Characterization of cDNA Clones Encoding Cytosine5-Methyltransferase

[0157] The BLASTX search using the EST sequences from clones listed inTable 30 revealed similarity of the polypeptides encoded by the cDNAs tocytosine 5-methyltransferase from Lycopersicon esculentum, Homo sapiens,Pisum sativum, or Schizosaccharomyces pombe (NCBI General IdentifierNos. 2887280, 4758184, 2654108, and 730347). Shown in Table 30 are theBLAST results for individual ESTs (“EST”), or for the sequences of theentire cDNA inserts comprising the indicated cDNA clones (“FIS”): TABLE30 BLAST Results for Sequences Encoding Polypeptides Homologous toCytosine 5-Methyltransferase BLAST Clone Status NCBI General IdentifierNo. pLog Score p0100.cbaaj24r EST 2887280 (L. esculentum) 78.70rr1.pk0043.f8 EST 4758184 (Homo sapiens) 12.70 sgs2c.pk004.h13 EST2654108 (Pisum sativum) 105.00 wr1.pk0076.a11 EST 2887280 (L.esculentum) >254.00 wre1n.pk0079.c6 EST 730347 (S. pombe) 17.22

[0158] A corn sequence with similarities to cytosine5-methyltransferases is found in the NCBI database having GeneralIdentifier No. 7489814. The sequence of the entire cDNA insert in therice, soybean, and wheat clones listed in Table 30 was determined. TheBLASTX search using the EST sequences from clones listed in Table 31revealed similarity of the polypeptides encoded by the cDNAs to cytosine5-methyltransferase from Homo sapiens, Pisum sativum, Zea mays, or Musmusculus (NCBI General Identifier Nos. 4758184, 7488824, 7489814, and6753660, respectively). Shown in Table 31 are the BLAST results for thesequences of the entire cDNA inserts comprising the indicated cDNAclones (“FIS”): TABLE 31 BLAST Results for Sequences EncodingPolypeptides Homologous to Cytosine 5-Methyltransferase NCBI GeneralBLAST Clone Status Identifier No. pLog Score rr1.pk0043.f8:fis FIS4758184 (Homo sapiens) 12.70 sgs2c.pk004.h13:fis FIS 7488824 (Pisumsativum) >254.00 wr1.pk0076.a11:fis FIS 7489814 (Zea mays) 180.00wre1n.pk0079.c6:fis FIS 6753660 (Mus musculus) 63.52

[0159] The data in Table 32 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:102, 104,106, 108, 110, 112, 114, 116, and 118 and the Homo sapiens, Pisumsativum, Zea mays, or Mus musculus sequences (NCBI General IdentifierNos. 4758184, 7488824, 7489814, and 6753660). TABLE 32 Percent Identityof Amino Acid Sequences Deduced From the Nucleotide Sequences Sequencesof cDNA Clones Encoding Polypeptides Homologous to Cytosine5-Methyltransferase Percent Identity to 4758184 7488824 7489814 6753660SEQ ID NO. (H. sapiens) (P. sativum) (Z. mays) (M. musculus) 102 14.377.1 97.1 14.9 104 39.8 21.7 20.5 39.8 106 19.9 88.1 77.8 16.5 108 13.881.5 92.2 12.5 110 13.8 81.5 92.2 12.5 112 37.1 22.5 19.1 37.1 114 13.891.2 82.8 13.2 116 13.6 80.5 91.3 12.4 118 33.7 12.1 12.1 35.3

[0160] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.) Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of corn, rice, soybean,and wheat cytosine 5-methyltransferases.

Example 13 Characterization of cDNA Clones Encoding Phospholipase Dα

[0161] The BLASTX search using the EST sequences from clones listed inTable 33 revealed similarity of the polypeptides encoded by the cDNAs toPhospholipase Dα (PLDα) from Vigna unguiculata and Zea mays (NCBIGeneral Identifier Nos. 3914359 and 2499708, respectively). Shown inTable 33 are the BLAST results for individual ESTs (“EST”): TABLE 33BLAST Results for Sequences Encoding Polypeptides Homologous toPhospholipase Dα BLAST Clone Status NCBI General Identifier No. pLogScore sgs4c.pk004.c18 EST 3914359 (Vigna unguiculata) 76.00wlk4.pk0022.b7 EST 2499708 (Zea mays) 15.52

[0162] The sequence of the entire cDNA insert in the clones listed inTable 33 was determined. The BLASTP search using the amino acidsequences derived from clones listed in Table 34 revealed similarity ofthe polypeptides encoded by the cDNAs to PLD α from Vigna unguiculataand Oryza sativa (NCBI General Identifier Nos. 3914359 and 2499709,respectively). Shown in Table 34 are the BLAST results for the aminoacid sequence of the entire protein derived from the sequences of theentire CDNA insert comprising the indicated CDNA clones(“CGS”): TABLE 34BLAST Results for Sequences Encoding Polypeptides Homologous toPhospholipase Dα NCBI BLAST Clone Status General Identifier No. pLogScore sfl1.pk128.a18:fis CGS 3914359 (Vigna >254.00 unguiculata)wlk4.pk0022.b7:fis CGS 2499709 (Oryza sativa) >254.00

[0163] The data in Table 35 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:120, 122,124, and 126 and the Vigna unguiculata and Oryza sativa sequences (NCBIGeneral Identifier Nos. 3914359 and 2499709, respectively). TABLE 35Percent Identity of Amino Acid Sequences Deduced From the NucleotideSequences of cDNA Clones Encoding Polypeptides Homologous toPhospholipase Dα Percent Identity to SEQ ID NO. 3914359 (V. unguiculata)2499709 (Oryza sativa) 120 87.2 67.7 121 36.2 43.6 122 90.1 79.5 12479.0 89.7

[0164] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant CDNA clones encode a substantial portion and an entire soybeanand wheat phospholipase D(Xs.

Example 14 Characterization of cDNA Clones Encoding Phospholipase Dγ

[0165] The BLASTX search using the EST sequences from clones listed inTable 36 revealed similarity of the polypeptides encoded by the cDNAs toPhospholipase Dγ from Arabidopsis thaliana (NCBI General Identifier No.2653885). Shown in Table 36 are the BLAST results for individual ESTs(“EST”): TABLE 36 BLAST Results for Sequences Encoding PolypeptidesPolypeptides to Phospholipase Dγ BLAST pLog Score Clone Status 2653885(Arabidopsis thaliana) p0083.cldaz07r EST 48.52 src3c.pk012.d7 EST 41.00

[0166] The sequence of the entire cDNA insert in the clones listed inTable 36 was determined. The BLASTP search using the amino acidsequences derived from clones listed in Table 37 revealed similarity ofthe polypeptides encoded by the Contig to phospholipase D fromArabidopsis thaliana (NCBI General Identifier No. 1871182) and by cDNAsto Phospholipase Dγ from Nicotiana tabacum or Gossypium hirsutum (NCBIGeneral Identifier Nos. 6180159 and 5442428, respectively). Shown inTable 37 are the BLAST results for the sequences encoded by the entirecDNA inserts comprising the indicated cDNA clones (“FIS”), or by thesequences of the entire protein encoded by the indicated FIS(“CGS”):TABLE 37 BLAST Results for Sequences Encoding Polypeptides HomologousPolypeptides to Phospholipase Dγ BLAST pLog Score 6180159 18711825442428 (N. (A. (G. Clone Status tabacum) thaliana) hirsutum)p0083.cldaz07r:fis FIS 54.05 52.22 src3c.pk012.d7:fis CGS >254.00>254.00

[0167] The data in Table 38 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:128, 130,132, and 134 and the Nicotiana tabacum and Gossypium hirsutum sequences(NCBI General Identifier Nos. 6180159 and 5442428, respectively). TABLE38 Percent Identity of Amino Acid Sequences Deduced From the NucleotideSequences Sequences of cDNA Clones Encoding Polypeptides Homologous toPhospholipase Dγ Percent Identity to SEQ ID NO. 6180159 (N. tabacum)5442428 (G. hirsutum) 128 78.4 77.6 130 11.3 54.0 132 79.2 76.0 134 72.669.1

[0168] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portion of a corn PhospholipaseDγ and a substantial portion and an entire soybean Phospholipase Dγ.

Example 15 Characterization of cDNA Clones Encoding TF IIF α Subunit

[0169] The BLASTX search using the EST sequences from clone listed inTable 39 revealed similarity of the polypeptides encoded by the cDNAs totranscription factor IIF α subunit (TF IIF α subunit) from Xenopuslaevis (NCBI General Identifier No. 464522). Shown in Table 39 are theBLAST results for individual ESTs (“EST”): TABLE 39 BLAST Results forSequences Encoding Polypeptides Homologous to TF IIF α Subunit BLASTpLog Score Clone Status 464522 (Xenopus laevis) p0026.ccrbd22r EST 5.00

[0170] The sequence of the entire cDNA insert in the clone listed inTable 39 was determined. The BLASTP search using the amino acidsequences derived from clone listed in Table 40 revealed similarity ofthe polypeptides encoded by the Contig to a putative protein withsimilarities to TF IIF α subunit from Arabidopsis thaliana (NCBI GeneralIdentifier No. 5823572) and by the cDNAs to TF IIF ax subunit fromXenopus laevis (NCBI General Identifier No. 464522). Shown in Table 40are the BLAST results for the amino acid sequences derived from theentire cDNA inserts comprising the indicated cDNA clone (“FIS”): TABLE40 BLAST Results for Sequences Encoding Polypeptides Homologous to TFIIF α Subunit BLAST pLog Score Clone Status 464522 (Xenopus laevis)p0026.ccrbd22r:fis FIS 22.00

[0171] The data in Table 41 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs: 136 and138 and the Xenopus laevis and Arabidopsis thaliana sequences (NCBIGeneral Identifier Nos. 464522 and 5823572, respectively). TABLE 41Percent Identity of Amino Acid Sequences Deduced From the NucleotideSequences of cDNA Clones Encoding Polypeptides Homologous to TF IIF αSubunit Percent Identity to SEQ ID NO. 464522 (Xenopus laevis) 5823572(A. thaliana) 136 22.9 65.1 138 17.2 55.8

[0172] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn TF IIF αsubunit.

Example 16 Characterization of cDNA Clones Encoding TF IIF β Subunits

[0173] The BLASTX search using the EST sequences from clones listed inTable 42 revealed similarity of the polypeptides encoded by the cDNAs toTF IIF β subunit from Schizosaccharomyces pombe (NCBI General IdentifierNo. 4049502). Table 42 are the BLAST results for individual ESTs(“EST”): TABLE 42 BLAST Results for Sequences Encoding PolypeptidesHomologous to TF IIF β Subunit BLAST pLog Score Clone Status 4049502(Schizosaccharomyces pombe) p0014.ctusq39r EST 11.70 w1m24.pk0018.g9 EST9.30

[0174] The sequence of the entire cDNA insert in the clones listed inTable 42 was determined. Further sequencing and searching of the DuPontproprietary database allowed the identification of other corn and riceclones encoding TF IIF β subunit. The BLASTX search using the ESTsequences from clones listed in Table 43 revealed similarity of thepolypeptides encoded by the cDNAs to TF IIF β subunit fromSchizosaccharomyces pombe (NCBI General Identifier No. 7493495). Theamino acid sequences having NCBI General Identifier No. 4049502 and NCBIGeneral Identifier No. 7493495 are 100% identical. Shown in Table 43 arethe BLAST results for the sequences of the entire cDNA insertscomprising the indicated cDNA clones (“FIS”), or for the sequences ofcontigs assembled from an FIS and one or more ESTs (“Contig”): TABLE 43BLAST Results for Sequences Encoding Polypeptides Homologous to TF IIF βSubunit BLAST pLog Score Clone Status 7493495 (Schizosaccharomycespombe) Contig of: Contig 15.30 p0014.ctusq39r:fis p0107.cbcap19rrca1n.pk007.p13:fis FIS 12.15 r10n.pk0063.e10:fis FIS 18.70r1s6.pk0059.b8:fis FIS 18.22 w1m24.pk0018.g9:fis FIS 10.70

[0175] The data in Table 44 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:140, 142,144, 146, 148, 150, and 152 and the Schizosaccharomyces pombe sequence(NCBI General Identifier No. 7493495). TABLE 44 Percent Identity ofAmino Acid Sequences Deduced From the Nucleotide Sequences of cDNAClones Encoding Polypeptides Homologous to TF IIF β Subunit PercentIdentity to SEQ ID NO. 7493495 (Schizosaccharomyces pombe) 140 38.4 14245.6 144 24.9 146 34.5 148 23.2 150 21.7 152 42.9

[0176] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode substantial portions of a corn TF IIF βsubunit.

Example 17 Characterization of cDNA Clones Encoding Asparaginyl-tRNASynthetase

[0177] The BLASTX search using the EST sequences from clones listed inTable 45 revealed similarity of the polypeptides encoded by the cDNAs toasparaginyl-tRNA synthetases from Arabidopsis thaliana (NCBI GeneralIdentifier No. 2664210). Shown in Table 45 are the BLAST results forindividual ESTs (“EST”), for the sequences of the entire cDNA insertscomprising the indicated cDNA clones (“FIS”), or for FISs encoding theentire protein (“CGS”): TABLE 45 BLAST Results for Sequences EncodingPolypeptides Homologous to Asparaginyl-tRNA Synthetase BLAST pLog ScoreClone Status 2664210 (Arabidopsis thaliana) p0119.cmtne90r:fis CGS130.00 r10n.pk0039.b7:fis FIS 141.00 src1c.pk001.a5:fis CGS >254.00wdr1.pk0005.f7:fis FIS 24.70 wr1.pk0067.h2 EST 20.30

[0178] The data in Table 46 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:154, 156,158, 160, and 162 and the Arabidopsis thaliana sequence (NCBI GeneralIdentifier No. 2664210). TABLE 46 Percent Identity of Amino AcidSequences Deduced From the Nucleotide Sequences of cDNA Clones EncodingPolypeptides Homologous to Asparaginyl-tRNA Synthetase Percent Identityto SEQ ID NO. 2664210 (Arabidopsis thaliana) 154 44.0 156 86.4 158 72.4160 87.7 162 36.7

[0179] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode a substantial portion of one rice and twowheat asparaginyl-tRNA synthetase, one entire corn, and one entiresoybean asparaginyl-tRNA synthetase.

Example 18 Characterization of cDNA Clones Encoding Glutaminyl-tRNASynthetase

[0180] The BLASTX search using the EST sequences from clones listed inTable 47 revealed similarity of the polypeptides encoded by the cDNAs toglutaminyl-tRNA synthetase from Lupinus luteus (NCBI General IdentifierNo. 3915866). Shown in Table 47 are the BLAST results for the sequencesof the entire cDNA inserts comprising the indicated cDNA clones (“FIS”):TABLE 47 BLAST Results for Sequences Encoding Polypeptides Homologous toGlutaminyl-tRNA Synthetase BLAST pLog Score Clone Status 3915866(Lupinus luteus) p0129.c1mad36r:fis FIS >254.00 rds1c.pk007.e9:fisFIS >254.00 sic1c.pk001.e18:fis FIS 61.15 w1mk1.pk0001.g6:fis FIS>254.00

[0181] The data in Table 48 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:164, 166,168, and 170 and the Lupinus luteus sequence (NCBI General IdentifierNo. 3915866). TABLE 48 Percent Identity of Amino Acid Sequences DeducedFrom the Nucleotide Sequences of cDNA Clones Encoding PolypeptidesHomologous to Glutaminyl-tRNA Synthetase Percent Identity to SEQ ID NO.3915866 (Lupinus luteus) 164 76.9 166 80.0 168 92.0 170 77.0

[0182] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode a substantial portion of a corn, a rice, asoybean, and a wheat glutaminyl-tRNA synthetase.

Example 19 Characterization of cDNA Clones Encoding EDS1

[0183] The BLASTX search using the EST sequences from clones listed inTable 49 revealed similarity of the polypeptides encoded by the cDNAs toEDS1 from Arabidopsis thaliana (NCBI General Identifier No. 4454567).Shown in Table 49 are the BLAST results for the sequences of the entireCDNA inserts comprising the indicated CDNA clones (“FIS”), or thesequences of FISs encoding the entire protein (“CGS”): TABLE 49 BLASTResults for Sequences Encoding Polypeptides Homologous to EDS1 BLASTpLog Score Clone Status 4454567 (Arabidopsis thaliana)r10n.pk127.m10:fis FIS 63.30 s1s2c.pk037.c11:fis CGS 126.00wre1n.pk160.d1:fis FIS 87.52

[0184] The data in Table 50 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:172, 174,and 176 and the Arabidopsis thaliana sequence (NCBI General IdentifierNo. 4454567). TABLE 50 Percent Identity of Amino Acid Sequences DeducedFrom the Nucleotide Sequences of cDNA Clones Encoding PolypeptidesHomologous to EDS1 Percent Identity to SEQ ID NO. 4454567 (Arabidopsisthaliana) 172 34.6 174 37.4 176 37.4

[0185] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant CDNA clones encode a substantial portion of a rice and a wheatEDS1 and an entire soybean EDS1.

Example 20 Characterization of cDNA Clones Encoding AP50

[0186] The BLASTX search using the EST sequences from clones listed inTable 51 revealed similarity of the polypeptides encoded by the cDNAs toAP50 from Arabidopsis thaliana (NCBI General Identifier No. 2271477).Shown in Table 51 are the BLAST results for individual ESTs (“EST”), forthe sequences of the entire cDNA inserts comprising the indicated cDNAclones (“FIS”), or for the sequences of FISs encoding an entire protein(“CGS”): TABLE 51 BLAST Results for Sequences Encoding PolypeptidesHomologous to AP50 BLAST pLog Score Clone Status 2271477 (Arabidopsisthaliana) p0127.cntam18r EST 79.15 r1r6.pk0083.e10:fis FIS 81.40sdp3c.pk006.d23:fis CGS >254.00 wdk1c.pk012.n13:fis FIS 35.15

[0187] The data in Table 52 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:178, 180,182, and 184 and the Arabidopsis thaliana sequence (NCBI GeneralIdentifier No. 2271477). TABLE 52 Percent Identity of Amino AcidSequences Deduced From the Nucleotide Sequences of cDNA Clones EncodingPolypeptides Homologous to AP50 Percent Identity to SEQ ID NO. 2271477(Arabidopsis thaliana) 178 80.0 180 88.9 182 94.3 184 88.5

[0188] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode a substantial portion of a corn, a rice, anda wheat AP50 and an entire soybean AP50.

Example 21 Characterization of cDNA Clones Encoding Alpha Adaptin

[0189] The BLASTX search using the EST sequences from clones listed inTable 53 revealed similarity of the polypeptides encoded by the cDNAs toalpha adaptin from Mus musculus or Drosophila melanogaster (NCBI GeneralIdentifier No. 6671561 and 7296210, respectively). Shown in Table 53 arethe BLAST results for the sequences of the entire cDNA insertscomprising the indicated cDNA clones (“FIS”), or for the sequences ofFISs encoding an entire protein (“CGS”): TABLE 53 BLAST Results forSequences Encoding Polypeptides Homologous to Alpha Adaptin NCBI BLASTClone Status General Identifier No. pLog Score p0119.cmtoj48r:fis CGS6671561 (Mus musculus) >254.00 s12.pk121.m20:fis FIS 7296210 (D.melanogaster) 29.00

[0190] The data in Table 54 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:186 and 188and the Mus musculus and Drosophila melanogaster sequences (NCBI GeneralIdentifier No. 6671561 and 7296210, respectively). TABLE 54 PercentIdentity of Amino Acid Sequences Deduced From the Nucleotide Sequencesof cDNA Clones Encoding Polypeptides Homologous to Alpha Adaptin PercentIdentity to SEQ ID NO. 6671561 (Mus musculus) 7296210 (D. melanogaster)186 31.5 35.1 188 18.2 19.6

[0191] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode a substantial portion of a soybean and anentire corn alpha adaptin.

Example 22 Characterization of cDNA Clones Encoding Beta' Adaptin

[0192] The BLASTX search using the EST sequences from clones listed inTable 55 revealed similarity of the polypeptides encoded by the cDNAs tobeta' adaptin from Arabidopsis thaliana, Drosophila melanogaster, and/orHomo sapiens (NCBI General Identifier Nos. 7441349, 481762, and 1532118,respectively). Shown in Table 55 are the BLAST results for individualESTs (“EST”), for the sequences of the entire cDNA inserts comprisingthe indicated cDNA clones (“FIS”), or for the sequences of FISs encodingan entire protein (“CGS”): TABLE 55 BLAST Results for Sequences EncodingPolypeptides Homologous to Beta′ Adaptin BLAST pLog Score 481762 15321187441349 (D. melano- (Homo Clone Status (A. thaliana) gaster) sapiens)p0119.cmtnr87r:fis CGS >254.00 >254.00 >254.00 rds1c.pk005.c17:fisFIS >254.00 176.00 174.00 sls2c.pk005.m4:fis FIS 113.00 111.00wkm2c.pk0002.a3 EST 11.40 15.15

[0193] The data in Table 56 presents a calculation of the percentidentity of the amino acid sequences set forth in SEQ ID NOs:190, 192,194, and 196 and the Arabidopsis thaliana, Drosophila melanogaster, andHomo sapiens sequence (NCBI General Identifier Nos. 7441349, 481762, and1532118, respectively). TABLE 56 Percent Identity of Amino AcidSequences Deduced From the Nucleotide Sequences of cDNA Clones EncodingPolypeptides Homologous to Beta′ Adaptin Percent Identity to 7441349481762 1532118 SEQ ID NO. (A. thaliana) (D. melanogaster) (Homo sapiens)190 79.2 47.4 47.6 192 79.5 49.0 49.8 194 43.1 46.0 45.3 196 69.0 31.937.9

[0194] Sequence alignments and percent identity calculations wereperformed using the Megalign program of the LASERGENE bioinformaticscomputing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of thesequences was performed using the Clustal method of alignment (Higginsand Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAPPENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwisealignments using the Clustal method were KTUPLE 1, GAP PENALTY=-3,WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments, BLAST scores andprobabilities indicate that the nucleic acid fragments comprising theinstant cDNA clones encode a substantial portion of a rice, a soybean,and a wheat beta' adaptin and an entire corn beta' adaptin.

Example 23 Expression of Chimeric Genes in Monocot Cells

[0195] A chimeric gene comprising a cDNA encoding the instantpolypeptides in sense orientation with respect to the maize 27 kD zeinpromoter that is located 5′ to the cDNA fragment, and the 10 kD zein 3′end that is located 3′ to the cDNA fragment, can be constructed. ThecDNA fragment of this gene may be generated by polymerase chain reaction(PCR) of the cDNA clone using appropriate oligonucleotide primers.Cloning sites (NcoI or SmaI) can be incorporated into theoligonucleotides to provide proper orientation of the DNA fragment wheninserted into the digested vector pML 103 as described below.Amplification is then performed in a standard PCR. The amplified DNA isthen digested with restriction enzymes NcoI and SmaI and fractionated onan agarose gel. The appropriate band can be isolated from the gel andcombined with a 4.9 kb NcoI-SmaI fragment of the plasmid pML 103.Plasmid pML 103 has been deposited under the terms of the BudapestTreaty at ATCC (American Type Culture Collection, 10801 UniversityBlvd., Manassas, Va. 20110-2209), and bears accession number ATCC 97366.The DNA segment from pML103 contains a 1.05 kb SalI-NcoI promoterfragment of the maize 27 kD zein gene and a 0.96 kb SmaI-SalI fragmentfrom the 3′ end of the maize 10 kD zein gene in the vector pGem9Zf(+)(Promega; Madison, Wis.). Vector and insert DNA can be ligated at 15° C.overnight, essentially as described (Maniatis). The ligated DNA may thenbe used to transform E. coli XL1-Blue (Epicurian Coli XL-l Blue™;Stratagene, La Jolla, Calif.). Bacterial transformants can be screenedby restriction enzyme digestion of plasmid DNA and limited nucleotidesequence analysis using the dideoxy chain termination method (Sequenase™DNA Sequencing Kit; U.S. Biochemical). The resulting plasmid constructwould comprise a chimeric gene encoding, in the 5′ to 3′ direction, themaize 27 kD zein promoter, a cDNA fragment encoding the instantpolypeptides, and the 10 kD zein 3′ region.

[0196] The chimeric gene described above can then be introduced intocorn cells by the following procedure. Immature corn embryos can bedissected from developing caryopses derived from crosses of the inbredcorn lines H99 and LH132. The embryos are isolated 10 to 11 days afterpollination when they are 1.0 to 1.5 mm long. The embryos are thenplaced with the axis-side facing down and in contact withagarose-solidified N6 medium (Chu et al. (1975) Sci. Sin. Peking18:659-668). The embryos are kept in the dark at 27° C. Friableembryogenic callus consisting of undifferentiated masses of cells withsomatic proembryoids and embryoids borne on suspensor structuresproliferates from the scutellum of these immature embryos. Theembryogenic callus isolated from the primary explant can be cultured onN6 medium and sub-cultured on this medium every 2 to 3 weeks.

[0197] The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag,Frankfurt, Germany) may be used in transformation experiments in orderto provide for a selectable marker. This plasmid contains the Pat gene(see European Patent Publication 0 242 236) which encodesphosphinothricin acetyl transferase (PAT). The enzyme PAT confersresistance to herbicidal glutamine synthetase inhibitors such asphosphinothricin. The pat gene in p35S/Ac is under the control of the35S promoter from Cauliflower Mosaic Virus (Odell et al. (1985) Nature313:810-812) and the 3′ region of the nopaline synthase gene from theT-DNA of the Ti plasmid of Agrobacterium tumefaciens.

[0198] The particle bombardment method (Klein et al. (1987) Nature327:70-73) may be used to transfer genes to the callus culture cells.According to this method, gold particles (1 μm in diameter) are coatedwith DNA using the following technique. Ten pg of plasmid DNAs are addedto 50 μL of a suspension of gold particles (60 mg per mL). Calciumchloride (50 μL of a 2.5 M solution) and spermidine free base (20 μL ofa 1.0 M solution) are added to the particles. The suspension is vortexedduring the addition of these solutions. After 10 minutes, the tubes arebriefly centrifuged (5 sec at 15,000 rpm) and the supernatant removed.The particles are resuspended in 200 μL of absolute ethanol, centrifugedagain and the supernatant removed. The ethanol rinse is performed againand the particles resuspended in a final volume of 30 μL of ethanol. Analiquot (5 μL) of the DNA-coated gold particles can be placed in thecenter of a Kapton™ flying disc (Bio-Rad Labs). The particles are thenaccelerated into the corn tissue with a Biolistic™ PDS-1000/He (Bio-RadInstruments, Hercules Calif.), using a helium pressure of 1000 psi, agap distance of 0.5 cm and a flying distance of 1.0 cm.

[0199] For bombardment, the embryogenic tissue is placed on filter paperover agarose-solidified N6 medium. The tissue is arranged as a thin lawnand covered a circular area of about 5 cm in diameter. The petri dishcontaining the tissue can be placed in the chamber of the PDS-1000/Heapproximately 8 cm from the stopping screen. The air in the chamber isthen evacuated to a vacuum of 28 inches of mercury (Hg). Themacrocarrier is accelerated with a helium shock wave using a rupturemembrane that bursts when the He pressure in the shock tube reaches 1000psi.

[0200] Seven days after bombardment the tissue can be transferred to N6medium that contains gluphosinate (2 mg per liter) and lacks casein orproline. The tissue continues to grow slowly on this medium. After anadditional 2 weeks the tissue can be transferred to fresh N6 mediumcontaining gluphosinate. After 6 weeks, areas of about 1 cm in diameterof actively growing callus can be identified on some of the platescontaining the glufosinate-supplemented medium. These calli may continueto grow when sub-cultured on the selective medium.

[0201] Plants can be regenerated from the transgenic callus by firsttransferring clusters of tissue to N6 medium supplemented with 0.2 mgper liter of 2,4-D. After two weeks the tissue can be transferred toregeneration medium (Fromm et al. (1990) Bio/Technology 8:833-839).

Example 24 Expression of Chimeric Genes in Dicot Cells

[0202] A seed-specific construct composed of the promoter andtranscription terminator from the gene encoding the β subunit of theseed storage protein phaseolin from the bean Phaseolus vulgaris (Doyleet al. (1986) J. Biol. Chem. 261:9228-9238) can be used for expressionof the instant polypeptides in transformed soybean. The phaseolinconstruct includes about 500 nucleotides upstream (5′) from thetranslation initiation codon and about 1650 nucleotides downstream (3′)from the translation stop codon of phaseolin. Between the 5′ and 3′regions are the unique restriction endonuclease sites Nco I (whichincludes the ATG translation initiation codon), Sma I, Kpn I and Xba I.The entire construct is flanked by Hind III sites.

[0203] The cDNA fragment of this gene may be generated by polymerasechain reaction (PCR) of the cDNA clone using appropriate oligonucleotideprimers. Cloning sites can be incorporated into the oligonucleotides toprovide proper orientation of the DNA fragment when inserted into theexpression vector. Amplification is then performed as described above,and the isolated fragment is inserted into a pUC 18 vector carrying theseed construct.

[0204] Soybean embryos may then be transformed with the expressionvector comprising sequences encoding the instant polypeptides. To inducesomatic embryos, cotyledons, 3-5 mm in length dissected from surfacesterilized, immature seeds of the soybean cultivar A2872, can becultured in the light or dark at 26° C. on an appropriate agar mediumfor 6-10 weeks. Somatic embryos which produce secondary embryos are thenexcised and placed into a suitable liquid medium. After repeatedselection for clusters of somatic embryos which multiplied as early,globular staged embryos, the suspensions are maintained as describedbelow.

[0205] Soybean embryogenic suspension cultures can be maintained in 35mL of liquid media on a rotary shaker, 150 rpm, at 26° C. withflorescent lights on a 16:8 hour day/night schedule. Cultures aresubcultured every two weeks by inoculating approximately 35 mg of tissueinto 35 mL of liquid medium.

[0206] Soybean embryogenic suspension cultures may then be transformedby the method of particle gun bombardment (Klein et al. (1987) Nature(London) 327:70-73, U.S. Pat. No. 4,945,050). A DuPont Biolistic™ PDS1000/HE instrument (helium retrofit) can be used for thesetransformations.

[0207] A selectable marker gene which can be used to facilitate soybeantransformation is a chimeric gene composed of the 35S promoter fromCauliflower Mosaic Virus (Odell et al. (1985) Nature 313:810-812), thehygromycin phosphotransferase gene from plasmid pJR225 (from E. coli;Gritz et al.(1983) Gene 25:179-188) and the 3′ region of the nopalinesynthase gene from the T-DNA of the Ti plasmid of Agrobacteriumtumefaciens. The seed construct comprising the phaseolin 5′ region, thefragment encoding the instant polypeptides and the phaseolin 3′ regioncan be isolated as a restriction fragment. This fragment can then beinserted into a unique restriction site of the vector carrying themarker gene.

[0208] To 50 μL of a 60 mg/mL 1 μm gold particle suspension is added (inorder): 5 μL DNA (1 μg/μL), 20 μL spermidine (0.1 M), and 50 μL CaCl₂(2.5 M). The particle preparation is then agitated for three minutes,spun in a microfuge for 10 seconds and the supernatant removed. TheDNA-coated particles are then washed once in 400 μL 70% ethanol andresuspended in 40 μL of anhydrous ethanol. The DNA/particle suspensioncan be sonicated three times for one second each. Five μL of theDNA-coated gold particles are then loaded on each macro carrier disk.

[0209] Approximately 300-400 mg of a two-week-old suspension culture isplaced in an empty 60×15 mm petri dish and the residual liquid removedfrom the tissue with a pipette. For each transformation experiment,approximately 5-10 plates of tissue are normally bombarded. Membranerupture pressure is set at 1100 psi and the chamber is evacuated to avacuum of 28 inches of mercury (Hg). The tissue is placed approximately3.5 inches away from the retaining screen and bombarded three times.Following bombardment, the tissue can be divided in half and placed backinto liquid and cultured as described above.

[0210] Five to seven days post bombardment, the liquid media may beexchanged with fresh media, and eleven to twelve days post bombardmentwith fresh media containing 50 mg/mL hygromycin. This selective mediacan be refreshed weekly. Seven to eight weeks post bombardment, green,transformed tissue may be observed growing from untransformed, necroticembryogenic clusters. Isolated green tissue is removed and inoculatedinto individual flasks to generate new, clonally propagated, transformedembryogenic suspension cultures. Each new line may be treated as anindependent transformation event. These suspensions can then besubcultured and maintained as clusters of immature embryos orregenerated into whole plants by maturation and germination ofindividual somatic embryos.

Example 25 Expression of Chimeric Genes in Microbial Cells

[0211] The cDNAs encoding the instant polypeptides can be inserted intothe T7 E. coli expression vector pBT430. This vector is a derivative ofpET-3a (Rosenberg et al. (1987) Gene 56:125-135) which employs thebacteriophage T7 RNA polymerase/T7 promoter system. Plasmid pBT430 wasconstructed by first destroying the EcoR I and Hind III sites in pET-3aat their original positions. An oligonucleotide adaptor containing EcoRI and Hind III sites was inserted at the BamH I site of pET-3a. Thiscreated pET-3aM with additional unique cloning sites for insertion ofgenes into the expression vector. Then, the Nde I site at the positionof translation initiation was converted to an Nco I site usingoligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM inthis region, 5′-CATATGG, was converted to 5′-CCCATGG in pBT430.

[0212] Plasmid DNA containing a cDNA may be appropriately digested torelease a nucleic acid fragment encoding the protein. This fragment maythen be purified on a 1% low melting agarose gel. Buffer and agarosecontain 10 μg/mL ethidium bromide for visualization of the DNA fragment.The fragment can then be purified from the agarose gel by digestion withGELase™ (Epicentre Technologies, Madison, Wis.) according to themanufacturer's instructions, ethanol precipitated, dried and resuspendedin 20 μL of water. Appropriate oligonucleotide adapters may be ligatedto the fragment using T4 DNA ligase (New England Biolabs (NEB), Beverly,Mass.). The fragment containing the ligated adapters can be purifiedfrom the excess adapters using low melting agarose as described above.The vector pBT430 is digested, dephosphorylated with alkalinephosphatase (NEB) and deproteinized with phenol/chloroform as describedabove. The prepared vector pBT430 and fragment can then be ligated at16° C. for 15 hours followed by transformation into DH5 electrocompetentcells (GIBCO BRL). Transformants can be selected on agar platescontaining LB media and 100 μg/mL ampicillin. Transformants containingthe gene encoding the instant polypeptides are then screened for thecorrect orientation with respect to the T7 promoter by restrictionenzyme analysis.

[0213] For high level expression, a plasmid clone with the cDNA insertin the correct orientation relative to the T7 promoter can betransformed into E. coli strain BL21(DE3) (Studier et al. (1986) J. Mol.Biol. 189:113-130). Cultures are grown in LB medium containingampicillin (100 mg/L) at 25° C. At an optical density at 600 nm ofapproximately 1, IPTG (isopropylthio-β-galactoside, the inducer) can beadded to a final concentration of 0.4 mM and incubation can be continuedfor 3 h at 25° C. Cells are then harvested by centrifugation andre-suspended in 50 μL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM DTTand 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glassbeads can be added and the mixture sonicated 3 times for about 5 secondseach time with a microprobe sonicator. The mixture is centrifuged andthe protein concentration of the supernatant determined. One μg ofprotein from the soluble fraction of the culture can be separated bySDS-polyacrylamide gel electrophoresis. Gels can be observed for proteinbands migrating at the expected molecular weight.

Example 26 Evaluating Compounds for Their Ability to Inhibit theActivity of tRNA Methyltransferases or Aminoacyl-tRNA Synthetases

[0214] The polypeptides described herein may be produced using anynumber of methods known to those skilled in the art. Such methodsinclude, but are not limited to, expression in bacteria as described inExample 25, or expression in eukaryotic cell culture, in planta, andusing viral expression systems in suitably infected organisms or celllines. The instant polypeptides may be expressed either as mature formsof the proteins as observed in vivo or as fusion proteins by covalentattachment to a variety of enzymes, proteins or affinity tags. Commonfusion protein partners include glutathione S-transferase (“GST”),thioredoxin (“Trx”), maltose binding protein, and C- and/or N-terminalhexahistidine polypeptide (“(His)₆”). The fusion proteins may beengineered with a protease recognition site at the fusion point so thatfusion partners can be separated by protease digestion to yield intactmature enzyme. Examples of such proteases include thrombin, enterokinaseand factor Xa. However, any protease can be used which specificallycleaves the peptide connecting the fusion protein and the enzyme.

[0215] Purification of the instant polypeptides, if desired, may utilizeany number of separation technologies familiar to those skilled in theart of protein purification. Examples of such methods include, but arenot limited to, homogenization, filtration, centrifugation, heatdenaturation, ammonium sulfate precipitation, desalting, pHprecipitation, ion exchange chromatography, hydrophobic interactionchromatography and affinity chromatography, wherein the affinity ligandrepresents a substrate, substrate analog or inhibitor. When the instantpolypeptides are expressed as fusion proteins, the purification protocolmay include the use of an affinity resin which is specific for thefusion protein tag attached to the expressed enzyme or an affinity resincontaining ligands which are specific for the enzyme.

[0216] For example, the instant polypeptides may be expressed as afusion protein coupled to the C-terminus of thioredoxin. In addition, a(His)₆ peptide may be engineered into the N-terminus of the fusedthioredoxin moiety to afford additional opportunities for affinitypurification. Other suitable affinity resins could be synthesized bylinking the appropriate ligands to any suitable resin such asSepharose-4B. In an alternate embodiment, a thioredoxin fusion proteinmay be eluted using dithiothreitol; however, elution may be accomplishedusing other reagents which interact to displace the thioredoxin from theresin. These reagents include β-mercaptoethanol or other reduced thiol.The eluted fusion protein may be subjected to further purification bytraditional means as stated above, if desired. Proteolytic cleavage ofthe thioredoxin fusion protein and the enzyme may be accomplished afterthe fusion protein is purified or while the protein is still bound tothe ThioBond™ affinity resin or other resin.

[0217] Crude, partially purified or purified enzyme, either alone or asa fusion protein, may be utilized in assays for the evaluation ofcompounds for their ability to inhibit enzymatic activation of theinstant polypeptides disclosed herein. Assays may be conducted underwell-known experimental conditions which permit optimal enzymaticactivity. For example, detection of altered activities of the introducedtRNA-mnm⁵s²U-MT would be performed in bacterial deletion backgrounds.The methods could be similar to, but not limited to, those presented inElseviers et al. (1984) Nucleic Acids Res. 12:3521-3534 or Hagervall andBjork (1984) Mol. Gen. Genet. 196:194-200. Assays for aminoacyl t-RNAsynthetases are presented by Lloyd et al. (1995) Nucleic Acids Res.23:2886-2892.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 196 <210> SEQ ID NO 1<211> LENGTH: 688 <212> TYPE: DNA <213> ORGANISM: Zea mays <220>FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (49) <221> NAME/KEY:unsure <222> LOCATION: (78) <221> NAME/KEY: unsure <222> LOCATION: (125)<221> NAME/KEY: unsure <222> LOCATION: (137) <221> NAME/KEY: unsure<222> LOCATION: (163) <221> NAME/KEY: unsure <222> LOCATION: (191) <221>NAME/KEY: unsure <222> LOCATION: (194) <221> NAME/KEY: unsure <222>LOCATION: (226) <221> NAME/KEY: unsure <222> LOCATION: (296) <221>NAME/KEY: unsure <222> LOCATION: (379) <221> NAME/KEY: unsure <222>LOCATION: (384)..(385) <221> NAME/KEY: unsure <222> LOCATION: (389)<221> NAME/KEY: unsure <222> LOCATION: (395) <221> NAME/KEY: unsure<222> LOCATION: (416) <221> NAME/KEY: unsure <222> LOCATION: (424) <221>NAME/KEY: unsure <222> LOCATION: (428) <221> NAME/KEY: unsure <222>LOCATION: (446) <221> NAME/KEY: unsure <222> LOCATION: (461) <221>NAME/KEY: unsure <222> LOCATION: (473) <221> NAME/KEY: unsure <222>LOCATION: (500)..(501) <221> NAME/KEY: unsure <222> LOCATION: (522)<221> NAME/KEY: unsure <222> LOCATION: (527) <221> NAME/KEY: unsure<222> LOCATION: (533)..(534) <221> NAME/KEY: unsure <222> LOCATION:(540) <221> NAME/KEY: unsure <222> LOCATION: (549) <221> NAME/KEY:unsure <222> LOCATION: (554) <221> NAME/KEY: unsure <222> LOCATION:(563) <221> NAME/KEY: unsure <222> LOCATION: (565) <221> NAME/KEY:unsure <222> LOCATION: (572) <221> NAME/KEY: unsure <222> LOCATION:(580) <221> NAME/KEY: unsure <222> LOCATION: (586) <221> NAME/KEY:unsure <222> LOCATION: (589) <221> NAME/KEY: unsure <222> LOCATION:(594)..(595) <221> NAME/KEY: unsure <222> LOCATION: (606) <221>NAME/KEY: unsure <222> LOCATION: (611) <221> NAME/KEY: unsure <222>LOCATION: (614) <221> NAME/KEY: unsure <222> LOCATION: (623)..(624)<221> NAME/KEY: unsure <222> LOCATION: (627)..(628) <221> NAME/KEY:unsure <222> LOCATION: (630)..(631) <221> NAME/KEY: unsure <222>LOCATION: (641) <221> NAME/KEY: unsure <222> LOCATION: (667) <221>NAME/KEY: unsure <222> LOCATION: (669) <221> NAME/KEY: unsure <222>LOCATION: (673) <221> NAME/KEY: unsure <222> LOCATION: (680) <221>NAME/KEY: unsure <222> LOCATION: (688) <400> SEQUENCE: 1 ggatgtacttaaccacatcc agcttggagg atctgttgca ggcacgganc ctgagggcag 60 tggcaaggccaagaacangc cattcatgac aaagacggcc tacttctact gggtgaccag 120 ggaanaggggtcctttnaat ggttccgagg ggtcatgaat gangtggctg agaaggacaa 180 ggatggagtcnttnaactcc acaaccactg ctcgagtgtg caccangaag gggatgttcg 240 ttctgccctcattgtcatgc tccacgagct ccagcacgcg aagaaaggag tcgacntctt 300 gtctggaactagtgtcaaca cgcattttgc acggccgaat tggcgaatcc tcttccaaca 360 tgttgcactgaaccacgana accnncgcnt ccganttttc tactgtggtg accccntcct 420 tgtnccanagctaccgcacc tgtcancgga ctccccctaa nacaaatact acnttccagt 480 tccacacaggacaacttctn nttggaagaa ctggaaaaaa anctctncgc gtnncctctn 540 atggtttcncaatntgtcaa ttncnattgt tnttaccctn tttccnaant tcannaatcc 600 cacganggganttngttttc gcnnttnnan naaaaaaaaa nacgggccgc ccgctctcaa 660 aggatcnancctncctttcn ccttcctn 688 <210> SEQ ID NO 2 <211> LENGTH: 134 <212> TYPE:PRT <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: UNSURE <222>LOCATION: (10) <221> NAME/KEY: UNSURE <222> LOCATION: (26) <221>NAME/KEY: UNSURE <222> LOCATION: (30) <221> NAME/KEY: UNSURE <222>LOCATION: (38) <221> NAME/KEY: UNSURE <222> LOCATION: (48)..(49) <221>NAME/KEY: UNSURE <222> LOCATION: (59) <221> NAME/KEY: UNSURE <222>LOCATION: (83) <221> NAME/KEY: UNSURE <222> LOCATION: (110) <221>NAME/KEY: UNSURE <222> LOCATION: (112) <221> NAME/KEY: UNSURE <222>LOCATION: (114) <221> NAME/KEY: UNSURE <222> LOCATION: (116) <221>NAME/KEY: UNSURE <222> LOCATION: (123) <221> NAME/KEY: UNSURE <222>LOCATION: (127) <221> NAME/KEY: UNSURE <222> LOCATION: (133) <400>SEQUENCE: 2 Pro Glu Gly Ser Gly Lys Ala Lys Asn Xaa Pro Phe Met Thr LysThr 1 5 10 15 Ala Tyr Phe Tyr Trp Val Thr Arg Glu Xaa Gly Ser Phe XaaTrp Phe 20 25 30 Arg Gly Val Met Asn Xaa Val Ala Glu Lys Asp Lys Asp GlyVal Xaa 35 40 45 Xaa Leu His Asn His Cys Ser Ser Val His Xaa Glu Gly AspVal Arg 50 55 60 Ser Ala Leu Ile Val Met Leu His Glu Leu Gln His Ala LysLys Gly 65 70 75 80 Val Asp Xaa Leu Ser Gly Thr Ser Val Asn Thr His PheAla Arg Pro 85 90 95 Asn Trp Arg Ile Leu Phe Gln His Val Ala Leu Asn HisXaa Asn Xaa 100 105 110 Arg Xaa Arg Xaa Phe Tyr Cys Gly Asp Pro Xaa LeuVal Pro Xaa Leu 115 120 125 Pro His Leu Ser Xaa Asp 130 <210> SEQ ID NO3 <211> LENGTH: 562 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <220>FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (535) <221> NAME/KEY:unsure <222> LOCATION: (545) <400> SEQUENCE: 3 gtttaaacgc ggcgcgcctacttctactgg gtgacgcggg agcaggggtc gttcgagtgg 60 ttccgggggg tgatggacgaggtggcggag acggacaaga agggggtgat cgagctgcac 120 aactactgca ccagcgtgtacgaggaaggg gacgcccggt cggcgctcat cgctatgctc 180 cagtcgctca accacgccaagcacggcgtc gacgtcgtct ccggcacccg cgtcaagacc 240 cacttcgccc gccccaactggcgcaacgtc tacaagcgca tcgccctcaa ccaccgcgac 300 caacgcgtcg gggtgttctactgtggcgcg ccggtgctga cgaaggaact gcgtgagtcg 360 ctcaagattt ctcgagaaagacgagcacga aattcgactt ccacaaggag aatttctagt 420 tatctggaat caaaaccaaagttttcgcac ggccatgttt agtacataca aagttctata 480 catatgacaa gtatgatgacatacatattg gaatgtagag ggattagatc aaagnaggta 540 ttgcntgatt gtgggccagg cg562 <210> SEQ ID NO 4 <211> LENGTH: 116 <212> TYPE: PRT <213> ORGANISM:Oryza sativa <400> SEQUENCE: 4 Thr Arg Arg Ala Tyr Phe Tyr Trp Val ThrArg Glu Gln Gly Ser Phe 1 5 10 15 Glu Trp Phe Arg Gly Val Met Asp GluVal Ala Glu Thr Asp Lys Lys 20 25 30 Gly Val Ile Glu Leu His Asn Tyr CysThr Ser Val Tyr Glu Glu Gly 35 40 45 Asp Ala Arg Ser Ala Leu Ile Ala MetLeu Gln Ser Leu Asn His Ala 50 55 60 Lys His Gly Val Asp Val Val Ser GlyThr Arg Val Lys Thr His Phe 65 70 75 80 Ala Arg Pro Asn Trp Arg Asn ValTyr Lys Arg Ile Ala Leu Asn His 85 90 95 Arg Asp Gln Arg Val Gly Val PheTyr Cys Gly Ala Pro Val Leu Thr 100 105 110 Lys Glu Leu Arg 115 <210>SEQ ID NO 5 <211> LENGTH: 489 <212> TYPE: DNA <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (234)<221> NAME/KEY: unsure <222> LOCATION: (310) <221> NAME/KEY: unsure<222> LOCATION: (405) <221> NAME/KEY: unsure <222> LOCATION:(417)..(418) <221> NAME/KEY: unsure <222> LOCATION: (424) <221>NAME/KEY: unsure <222> LOCATION: (433) <221> NAME/KEY: unsure <222>LOCATION: (454) <221> NAME/KEY: unsure <222> LOCATION: (464)..(465)<221> NAME/KEY: unsure <222> LOCATION: (476) <221> NAME/KEY: unsure<222> LOCATION: (489) <400> SEQUENCE: 5 cttcgagtgg ttccgcggcg tgatggacgaggtggcggag acggacagga agggcgtcat 60 cgagctgcac aactactgca ccagcgtgtacgaggaaggg gacgcccggt ccgcgctcat 120 cgccatgctc cagtcgctca accacgccaagcacggcgtc gacgtggtgt ccggcacccg 180 cgtcaagacc cacttcgcca ggcctaactggcgcaacgtc tacaagcgca tcgntcaacc 240 accagaacca gcgcgtcgga gtgttctactgcggcgcccc ggtgctgacc aaggagctgc 300 gcaactggcn aggacttctc gcggaagaccaacaccaagt tcgagttcca caaggagaac 360 tttagtctct ctctctctat atatatgtaaggtatgaatg gttanaaagg gatcgannac 420 caantaggat ttncttaaca attggttgggggtnaaaccc aaannttttt tttaanatat 480 atatctttn 489 <210> SEQ ID NO 6<211> LENGTH: 122 <212> TYPE: PRT <213> ORGANISM: Triticum aestivum<220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (78)..(79) <221>NAME/KEY: UNSURE <222> LOCATION: (103)..(104)..(105) <400> SEQUENCE: 6Phe Glu Trp Phe Arg Gly Val Met Asp Glu Val Ala Glu Thr Asp Arg 1 5 1015 Lys Gly Val Ile Glu Leu His Asn Tyr Cys Thr Ser Val Tyr Glu Glu 20 2530 Gly Asp Ala Arg Ser Ala Leu Ile Ala Met Leu Gln Ser Leu Asn His 35 4045 Ala Lys His Gly Val Asp Val Val Ser Gly Thr Arg Val Lys Thr His 50 5560 Phe Ala Arg Pro Asn Trp Arg Asn Val Tyr Lys Arg Ile Xaa Xaa Asn 65 7075 80 His Gln Asn Gln Arg Val Gly Val Phe Tyr Cys Gly Ala Pro Val Leu 8590 95 Thr Lys Glu Leu Arg Asn Xaa Xaa Xaa Asp Phe Ser Arg Lys Thr Asn100 105 110 Thr Lys Phe Glu Phe His Lys Glu Asn Phe 115 120 <210> SEQ IDNO 7 <211> LENGTH: 648 <212> TYPE: DNA <213> ORGANISM: Zea mays <400>SEQUENCE: 7 ccacgcgtcc gggatgtact taaccacatc cagcttggag gatctgttgcaggcacggag 60 cctgagggca gtggcaaggc caagaagagg ccattcatga caaagagggcctacttctac 120 tgggtgacca gggaagaggg gtcctttgaa tggttccgag gggtcatgaatgaggtggct 180 gagaaggaca aggatggagt cattgaactc cacaaccact gctcgagtgtgcaccaggaa 240 ggggatgtac gttctgccct cattgtcatg ctccaggagc tccagcacgcgaagaaggga 300 gtcgacatct tgtctggaac tagtgtcaag acgcattttg cacggccgaattggcgaagc 360 gtcttcaaac atgttgcagt gaaccacgag aaccaacgca tcggagttttctactgtggt 420 gagcccgtcc ttgtgccaca gctacggcag ctgtcagcgg acttcacccacaagacaaat 480 actaagttcg agttccacaa ggagaacttc taatggagga actggagaaaagctctaggc 540 gtggccctct aatggtttca gaatatgtga attacgatgg ttgtaacatatatacgagta 600 gaagaataag acgatgggag ttgttttgtt gtaaaaaaaa aaaaaaag 648<210> SEQ ID NO 8 <211> LENGTH: 170 <212> TYPE: PRT <213> ORGANISM: Zeamays <400> SEQUENCE: 8 Pro Arg Val Arg Asp Val Leu Asn His Ile Gln LeuGly Gly Ser Val 1 5 10 15 Ala Gly Thr Glu Pro Glu Gly Ser Gly Lys AlaLys Lys Arg Pro Phe 20 25 30 Met Thr Lys Arg Ala Tyr Phe Tyr Trp Val ThrArg Glu Glu Gly Ser 35 40 45 Phe Glu Trp Phe Arg Gly Val Met Asn Glu ValAla Glu Lys Asp Lys 50 55 60 Asp Gly Val Ile Glu Leu His Asn His Cys SerSer Val His Gln Glu 65 70 75 80 Gly Asp Val Arg Ser Ala Leu Ile Val MetLeu Gln Glu Leu Gln His 85 90 95 Ala Lys Lys Gly Val Asp Ile Leu Ser GlyThr Ser Val Lys Thr His 100 105 110 Phe Ala Arg Pro Asn Trp Arg Ser ValPhe Lys His Val Ala Val Asn 115 120 125 His Glu Asn Gln Arg Ile Gly ValPhe Tyr Cys Gly Glu Pro Val Leu 130 135 140 Val Pro Gln Leu Arg Gln LeuSer Ala Asp Phe Thr His Lys Thr Asn 145 150 155 160 Thr Lys Phe Glu PheHis Lys Glu Asn Phe 165 170 <210> SEQ ID NO 9 <211> LENGTH: 1200 <212>TYPE: DNA <213> ORGANISM: Oryza sativa <400> SEQUENCE: 9 cacgaggtttaaacgcggcg cgcctacttc tactgggtga cgcgggagca ggggtcgttc 60 gagtggttccggggggtgat ggacgaggtg gcggagacgg acaagaaggg ggtgatcgag 120 ctgcacaactactgcaccag cgtgtacgag gaaggggacg cccggtcggc gctcatcgct 180 atgctccagtcgctcaacca cgccaagcac ggcgtcgacg tcgtctccgg cacccgcgtc 240 aagacccacttcgcccgccc caactggcgc aacgtctaca agcgcatcgc cctcaaccac 300 cgcgaccaacgcgtcggggt gttctactgt ggcgcgccgg tgctgacgaa ggaactgcgt 360 gagctcgctcaagatttctc gagaaagacg agcacgaaat tcgacttcca caaggagaat 420 ttctagttatctggaatcaa aaccaaagtt ttcgcacggc catgtttagt acatacaaag 480 ttctatacatatgacaagta tgatgacata catattggaa atgtagaggg attagatcaa 540 agtaggtattgcttgattgt ggccaggctt ggccagataa tttcatcggt tttttgctct 600 ggaagaataatccaatgccc ccctttgtac agatcttctc ccagataata ctttgtaata 660 cttagagtagccaatttgat aaaatcagtt tgtatctagt aacatgtaga gagtttcatg 720 gaggcctaatcaggtcaaaa atatcacaaa tgtttggcca agaacaagaa aaatcatgcc 780 cattgcaagaaagaaattgc actttttgtg gctggtactg aattgcggac ctgaagaaca 840 cacctgatattggagcactg cacatggtgg tgaccaacaa acaagaacaa cctcatgtcc 900 tatgtccacaaccaatgagc actgccaaca tattagggtt ctagaagatg atgagcatga 960 tggtaccctcctggcccaca tagcaactag agtagttgtc tttggttttt actttttgga 1020 atatctgttctttctgttcc ttcaatttta ttccattgta ttctgtgaag tgcatgataa 1080 agaggggattacgcaaaata atactttttg caattcatgg ctgtaaatgc actgttgaac 1140 aataaaaatctgagcttttt ttttttaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1200 <210> SEQ IDNO 10 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Oryza sativa<400> SEQUENCE: 10 Thr Arg Arg Ala Tyr Phe Tyr Trp Val Thr Arg Glu GlnGly Ser Phe 1 5 10 15 Glu Trp Phe Arg Gly Val Met Asp Glu Val Ala GluThr Asp Lys Lys 20 25 30 Gly Val Ile Glu Leu His Asn Tyr Cys Thr Ser ValTyr Glu Glu Gly 35 40 45 Asp Ala Arg Ser Ala Leu Ile Ala Met Leu Gln SerLeu Asn His Ala 50 55 60 Lys His Gly Val Asp Val Val Ser Gly Thr Arg ValLys Thr His Phe 65 70 75 80 Ala Arg Pro Asn Trp Arg Asn Val Tyr Lys ArgIle Ala Leu Asn His 85 90 95 Arg Asp Gln Arg Val Gly Val Phe Tyr Cys GlyAla Pro Val Leu Thr 100 105 110 Lys Glu Leu Arg Glu Leu Ala Gln Asp PheSer Arg Lys Thr Ser Thr 115 120 125 Lys Phe Asp Phe His Lys Glu Asn Phe130 135 <210> SEQ ID NO 11 <211> LENGTH: 543 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 11 gcacgagctt cgagtggttccgcggcgtga tggacgaggt ggcggagacg gacaggaagg 60 gcgtcatcga gctgcacaactactgcacca gcgtgtacga ggaaggggac gcccggtccg 120 cgctcatcgc catgctccagtcgctcaacc acgccaagca cggcgtcgac gtggtgtccg 180 gcacccgcgt caagacccacttcgccaggc ctaactggcg caacgtctac aagcgcatcg 240 cgctcaacca ccagaaccagcgcgtcggag tgttctactg cggcgccccg gtgctgacca 300 aggagctgcg cgagctggcgcaggacttct cgcggaagac caacaccaag ttcgagttcc 360 acaaggagaa cttctagctctctctctctc tatatatatg taaggatatg gatgggtgta 420 gagaggggat cggagtagcgcaaagtaggt atttgcttga ccagttgggt tgggtgggtc 480 aaaaccccaa cagtttttttttttaattat tattattatt cttcttaaaa aaaaaaaaaa 540 aaa 543 <210> SEQ ID NO12 <211> LENGTH: 124 <212> TYPE: PRT <213> ORGANISM: Triticum aestivum<400> SEQUENCE: 12 Thr Ser Phe Glu Trp Phe Arg Gly Val Met Asp Glu ValAla Glu Thr 1 5 10 15 Asp Arg Lys Gly Val Ile Glu Leu His Asn Tyr CysThr Ser Val Tyr 20 25 30 Glu Glu Gly Asp Ala Arg Ser Ala Leu Ile Ala MetLeu Gln Ser Leu 35 40 45 Asn His Ala Lys His Gly Val Asp Val Val Ser GlyThr Arg Val Lys 50 55 60 Thr His Phe Ala Arg Pro Asn Trp Arg Asn Val TyrLys Arg Ile Ala 65 70 75 80 Leu Asn His Gln Asn Gln Arg Val Gly Val PheTyr Cys Gly Ala Pro 85 90 95 Val Leu Thr Lys Glu Leu Arg Glu Leu Ala GlnAsp Phe Ser Arg Lys 100 105 110 Thr Asn Thr Lys Phe Glu Phe His Lys GluAsn Phe 115 120 <210> SEQ ID NO 13 <211> LENGTH: 647 <212> TYPE: DNA<213> ORGANISM: Zea mays <400> SEQUENCE: 13 gtgtggtgaa gctgccatcgccgctacacc accttgccgg tttcaatgcc ttctggtacg 60 ctcaccacct gctggtccttgcgtatgtcc tgctggtggt gcactcctac ttcatattcc 120 tcaccaggga gtggtacaagaagacgacat ggatgtacct gattgtccct gtcctcttct 180 atgcctgtga aagagtcatcaggaaatttc gtgagaacaa ctaccatgcg ggaattgtga 240 gggcagcaat ttatccgggagatgtgctct ctattcacat gaagaagcca cagggtttca 300 agtacaagag tgggatgtatctgtttgtta aatgcccaga agtctcgccc ttcgagtggc 360 accccttctc tataacttcggcaccaggcg atgactactt gagtgtgcat atccgtacgc 420 tgggtgactg gacatccgaactgcggatgc tttttgggaa ggcttgccag gcacaagtta 480 acttccaaga aaggctacccttacaagact tgaaactaca gttgtggcag acgcccagac 540 agaggacact aggtttccccaaggtctaca tagacgggcc atacggtgca ccagcacaaa 600 attacaggaa atatgacattcttctgctta ttggccttgg aatagga 647 <210> SEQ ID NO 14 <211> LENGTH: 167<212> TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 14 Leu Pro SerPro Leu His His Leu Ala Gly Phe Asn Ala Phe Trp Tyr 1 5 10 15 Ala HisHis Leu Leu Val Leu Ala Tyr Val Leu Leu Val Val His Ser 20 25 30 Tyr PheIle Phe Leu Thr Arg Glu Trp Tyr Lys Lys Thr Thr Trp Met 35 40 45 Tyr LeuIle Val Pro Val Leu Phe Tyr Ala Cys Glu Arg Val Ile Arg 50 55 60 Lys PheArg Glu Asn Asn Tyr His Ala Gly Ile Val Arg Ala Ala Ile 65 70 75 80 TyrPro Gly Asp Val Leu Ser Ile His Met Lys Lys Pro Gln Gly Phe 85 90 95 LysTyr Lys Ser Gly Met Tyr Leu Phe Val Lys Cys Pro Glu Val Ser 100 105 110Pro Phe Glu Trp His Pro Phe Ser Ile Thr Ser Ala Pro Gly Asp Asp 115 120125 Tyr Leu Ser Val His Ile Arg Thr Leu Gly Asp Trp Thr Ser Glu Leu 130135 140 Arg Met Leu Phe Gly Lys Ala Cys Gln Ala Gln Val Asn Phe Gln Glu145 150 155 160 Arg Leu Pro Leu Gln Asp Leu 165 <210> SEQ ID NO 15 <211>LENGTH: 577 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <220> FEATURE:<221> NAME/KEY: unsure <222> LOCATION: (340) <221> NAME/KEY: unsure<222> LOCATION: (427) <221> NAME/KEY: unsure <222> LOCATION:(496)..(497) <221> NAME/KEY: unsure <222> LOCATION: (507) <221>NAME/KEY: unsure <222> LOCATION: (517) <221> NAME/KEY: unsure <222>LOCATION: (519)..(520) <221> NAME/KEY: unsure <222> LOCATION: (533)<221> NAME/KEY: unsure <222> LOCATION: (537) <221> NAME/KEY: unsure<222> LOCATION: (541) <221> NAME/KEY: unsure <222> LOCATION: (549) <221>NAME/KEY: unsure <222> LOCATION: (552) <221> NAME/KEY: unsure <222>LOCATION: (568) <400> SEQUENCE: 15 gttgggattc agaaggttgc agtgtatcccgggaatgtat tggctcttta tatgtcgaag 60 ccacctggtt tcagataccg tagtgggcagtacatcttca taaaatgcac tgctgtgtct 120 ccatatgaat ggcatccatt ttccataacatcagcacctg gagatgatta tcttagtgtt 180 catattcgca caaggggtga ttgggcttcacggcttaaga actgttttct cctgaggcat 240 gccgaccccc caactgaggg agaaaatgggactacttaga ctgacctttc ccaaagggaa 300 taacggacga aaacccaaga tccccaaaacttttgggccn atggaccgtt atggtgcacc 360 ggcacaagat taccgtgaaa tacgatgtgctacttctcaa ccgggctggg aaccggacca 420 ccctttngat tacattgtga agactgctaacacatcaagg tgaggataat tggacacgga 480 ccggagacac acaagnnaag aagaacnttatgacaanann ccactcacgg ggnacanaag 540 ngcctttant gntaagggta taacagtntcaaagcag 577 <210> SEQ ID NO 16 <211> LENGTH: 76 <212> TYPE: PRT <213>ORGANISM: Oryza sativa <400> SEQUENCE: 16 Val Gly Ile Gln Lys Val AlaVal Tyr Pro Gly Asn Val Leu Ala Leu 1 5 10 15 Tyr Met Ser Lys Pro ProGly Phe Arg Tyr Arg Ser Gly Gln Tyr Ile 20 25 30 Phe Ile Lys Cys Thr AlaVal Ser Pro Tyr Glu Trp His Pro Phe Ser 35 40 45 Ile Thr Ser Ala Pro GlyAsp Asp Tyr Leu Ser Val His Ile Arg Thr 50 55 60 Arg Gly Asp Trp Ala SerArg Leu Lys Asn Cys Phe 65 70 75 <210> SEQ ID NO 17 <211> LENGTH: 457<212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221>NAME/KEY: unsure <222> LOCATION: (336) <400> SEQUENCE: 17 gcatttccaagttgtataac tcaatgtatc caaggttgtc cggatccagt tcctccatga 60 taagggcagcatattcctct gcacgatctt ttaattttga cagcttattg gctgaagcgc 120 ttaaggtgataatctctttt acttcttctt cattaattcg tccatcggca tctttgtcca 180 ccatgtcaaaaaaggtctga agccgtgaat caaaactctg gtctgtaatt tgctcccaaa 240 attcacgcaactgatccttc gttatggaag ctgatgttat ccctcgacga cgagctaatg 300 catcgaataactcaccggca aactccttcg attcgntcat ccctatgcac tggctaaaag 360 cgaagtcttggggagctttg catcaatggc caactcatcc gaagcgcttc tcaacctgtg 420 accaacctttagtggctgct ttgggcaaga acttaag 457 <210> SEQ ID NO 18 <211> LENGTH: 144<212> TYPE: PRT <213> ORGANISM: Glycine max <220> FEATURE: <221>NAME/KEY: UNSURE <222> LOCATION: (26) <221> NAME/KEY: UNSURE <222>LOCATION: (34) <400> SEQUENCE: 18 Ala Ala Thr Lys Gly Trp Ser Gln ValGlu Lys Arg Phe Asp Glu Leu 1 5 10 15 Ala Ile Asp Ala Lys Leu Pro LysThr Xaa Phe Ser Gln Cys Ile Gly 20 25 30 Met Xaa Glu Ser Lys Glu Phe AlaGly Glu Leu Phe Asp Ala Leu Ala 35 40 45 Arg Arg Arg Gly Ile Thr Ser AlaSer Ile Thr Lys Asp Gln Leu Arg 50 55 60 Glu Phe Trp Glu Gln Ile Thr AspGln Ser Phe Asp Ser Arg Leu Gln 65 70 75 80 Thr Phe Phe Asp Met Val AspLys Asp Ala Asp Gly Arg Ile Asn Glu 85 90 95 Glu Glu Val Lys Glu Ile IleThr Leu Ser Ala Ser Ala Asn Lys Leu 100 105 110 Ser Lys Leu Lys Asp ArgAla Glu Glu Tyr Ala Ala Leu Ile Met Glu 115 120 125 Glu Leu Asp Pro AspAsn Leu Gly Tyr Ile Glu Leu Tyr Asn Leu Glu 130 135 140 <210> SEQ ID NO19 <211> LENGTH: 577 <212> TYPE: DNA <213> ORGANISM: Triticum aestivum<220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (214) <221>NAME/KEY: unsure <222> LOCATION: (228) <221> NAME/KEY: unsure <222>LOCATION: (282) <221> NAME/KEY: unsure <222> LOCATION: (370) <221>NAME/KEY: unsure <222> LOCATION: (372) <221> NAME/KEY: unsure <222>LOCATION: (412) <221> NAME/KEY: unsure <222> LOCATION: (429) <221>NAME/KEY: unsure <222> LOCATION: (450) <221> NAME/KEY: unsure <222>LOCATION: (462) <221> NAME/KEY: unsure <222> LOCATION: (476) <221>NAME/KEY: unsure <222> LOCATION: (482) <221> NAME/KEY: unsure <222>LOCATION: (488) <221> NAME/KEY: unsure <222> LOCATION: (510) <221>NAME/KEY: unsure <222> LOCATION: (514) <221> NAME/KEY: unsure <222>LOCATION: (519) <221> NAME/KEY: unsure <222> LOCATION: (526) <221>NAME/KEY: unsure <222> LOCATION: (538) <221> NAME/KEY: unsure <222>LOCATION: (542) <221> NAME/KEY: unsure <222> LOCATION: (545) <400>SEQUENCE: 19 gaatacgatg tgctcctgct cattggactg ggcattggag ccactccattgattagcatt 60 gtgaaggatg tgcttaacca cacccagcat ggaggatctg tttcaggcacggagcctgag 120 ggcagtggca aggccaagaa gaggccattc atgacgaaga gggcctacttctactgggtg 180 accagagaag agggatcttt cgaatggttc cgangggtca tgaacgangtgggctgagaa 240 aggacaagga tgggagtcat tgaactccac aacaactgct cnattgtgtacaaggaaggg 300 atgcacgttc tgcactccat tgtcatgctc caagactcca acaatgcaaagaaaggggtc 360 gacatcttgn cnggaactaa tgtcaagacg cacttcgcgc gtcccattggcnaacgtctc 420 caacatgtng catgaaccat gagaacaacn ctttgggttt cnacggggtaacccgncttt 480 tncaaagntc ggaatggcac aaattccccn aagncaacna atttgntccaagggaatnca 540 anganacgga aaccgggcgg gcccaaggtt aaatgat 577 <210> SEQ IDNO 20 <211> LENGTH: 134 <212> TYPE: PRT <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (71)<221> NAME/KEY: UNSURE <222> LOCATION: (75) <221> NAME/KEY: UNSURE <222>LOCATION: (78) <221> NAME/KEY: UNSURE <222> LOCATION: (123) <400>SEQUENCE: 20 Tyr Asp Val Leu Leu Leu Ile Gly Leu Gly Ile Gly Ala Thr ProLeu 1 5 10 15 Ile Ser Ile Val Lys Asp Val Leu Asn His Thr Gln His GlyGly Ser 20 25 30 Val Ser Gly Thr Glu Pro Glu Gly Ser Gly Lys Ala Lys LysArg Pro 35 40 45 Phe Met Thr Lys Arg Ala Tyr Phe Tyr Trp Val Thr Arg GluGlu Gly 50 55 60 Ser Phe Glu Trp Phe Arg Xaa Val Met Asn Xaa Val Gly XaaGlu Arg 65 70 75 80 Thr Arg Met Gly Val Ile Glu Leu His Asn Asn Cys SerIle Val Tyr 85 90 95 Lys Glu Gly Met His Val Leu His Ser Ile Val Met LeuGln Asp Ser 100 105 110 Asn Asn Ala Lys Lys Gly Val Asp Ile Leu Xaa GlyThr Asn Val Lys 115 120 125 Thr His Phe Ala Arg Pro 130 <210> SEQ ID NO21 <211> LENGTH: 1257 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <400>SEQUENCE: 21 gcacgaggtt gggattcaga aggttgcagt gtatcccggg aatgtattggctctttatat 60 gtcgaagcca cctggtttca gataccgtag tgggcagtac atcttcataaaatgcactgc 120 tgtgtctcca tatgaatggc atccattttc cataacatca gcacctggagatgattatct 180 tagtgttcat attcgcacaa ggggtgattg gacttcacgg cttagaactgttttctctga 240 ggcatgccga ccccccactg agggagaaag tggactactt agagctgacctttccaaggg 300 aataacggac gaaaacgcaa gattcccaaa acttttggtc gatggaccgtatggtgcacc 360 ggcacaagat taccgtgaat acgatgtgct acttctcatc gggctgggcatcggagccac 420 ccctttgatt agcattgtga aggacgtgct taaccacatt caaggtgagggatcagttgg 480 aaccacggag ccggagagca gcagcaaggc gaagaagaaa cctttcatgacgaagagagc 540 ctacttctac tgggtgacga gagaggaggg ctcgtttgag tggttcagaggcgtcatgaa 600 cgaggtgtct gagaaggaca aggatggagt cattgagctc cataaccactgctcaagcgt 660 gtaccaggaa ggcgatgctc gttctgctct cattgtcatg ctccaagaacttcagcatgc 720 gaagaagggc gtcgatatct tgtcgggaac tagtgtgaag acccatttcgcacgacctaa 780 ttggcgaagc gtcttcaaga aggttgcggt caaccatgag aaccagcgcgtcggtgtgtt 840 ctactgtggt gagcctgtgc tggttcccca actaaggcag ttgtcagcagatttcaccca 900 caagacaaac acaagatttg atttccacaa ggagaacttc taatggtacaaattgagaaa 960 tacccgtgta tggttttgta tgtagttctt tatcatgtga attatatggtttctaatata 1020 tataaagttg gacaaaataa atgaaatgat ggaagctatt ttgttttttaagatgtcaaa 1080 agtctgcaat atctttttac aagagtgctg tctattcatg tatacacctagtggaagaag 1140 ctgtgcttca tgttgtagct tacatagatg aagggaagtt ctctttgttgtgaccaaagg 1200 atgcctagaa tcatgtaaca ttgtgatgtt ccctttcaaa aaaaaaaaaaaaaaaaa 1257 <210> SEQ ID NO 22 <211> LENGTH: 313 <212> TYPE: PRT <213>ORGANISM: Oryza sativa <400> SEQUENCE: 22 His Glu Val Gly Ile Gln LysVal Ala Val Tyr Pro Gly Asn Val Leu 1 5 10 15 Ala Leu Tyr Met Ser LysPro Pro Gly Phe Arg Tyr Arg Ser Gly Gln 20 25 30 Tyr Ile Phe Ile Lys CysThr Ala Val Ser Pro Tyr Glu Trp His Pro 35 40 45 Phe Ser Ile Thr Ser AlaPro Gly Asp Asp Tyr Leu Ser Val His Ile 50 55 60 Arg Thr Arg Gly Asp TrpThr Ser Arg Leu Arg Thr Val Phe Ser Glu 65 70 75 80 Ala Cys Arg Pro ProThr Glu Gly Glu Ser Gly Leu Leu Arg Ala Asp 85 90 95 Leu Ser Lys Gly IleThr Asp Glu Asn Ala Arg Phe Pro Lys Leu Leu 100 105 110 Val Asp Gly ProTyr Gly Ala Pro Ala Gln Asp Tyr Arg Glu Tyr Asp 115 120 125 Val Leu LeuLeu Ile Gly Leu Gly Ile Gly Ala Thr Pro Leu Ile Ser 130 135 140 Ile ValLys Asp Val Leu Asn His Ile Gln Gly Glu Gly Ser Val Gly 145 150 155 160Thr Thr Glu Pro Glu Ser Ser Ser Lys Ala Lys Lys Lys Pro Phe Met 165 170175 Thr Lys Arg Ala Tyr Phe Tyr Trp Val Thr Arg Glu Glu Gly Ser Phe 180185 190 Glu Trp Phe Arg Gly Val Met Asn Glu Val Ser Glu Lys Asp Lys Asp195 200 205 Gly Val Ile Glu Leu His Asn His Cys Ser Ser Val Tyr Gln GluGly 210 215 220 Asp Ala Arg Ser Ala Leu Ile Val Met Leu Gln Glu Leu GlnHis Ala 225 230 235 240 Lys Lys Gly Val Asp Ile Leu Ser Gly Thr Ser ValLys Thr His Phe 245 250 255 Ala Arg Pro Asn Trp Arg Ser Val Phe Lys LysVal Ala Val Asn His 260 265 270 Glu Asn Gln Arg Val Gly Val Phe Tyr CysGly Glu Pro Val Leu Val 275 280 285 Pro Gln Leu Arg Gln Leu Ser Ala AspPhe Thr His Lys Thr Asn Thr 290 295 300 Arg Phe Asp Phe His Lys Glu AsnPhe 305 310 <210> SEQ ID NO 23 <211> LENGTH: 918 <212> TYPE: DNA <213>ORGANISM: Glycine max <400> SEQUENCE: 23 tcgatcataa attaattacaaaaataacat aggttgtcgc aacatggaga ttcacgaaaa 60 ccaacacgag tcatggtcggaaacggagag cacgggaagc cggagcagga gagtgggctt 120 cagtgggcct ctgagcggaccactgagcgg gcctttgagt gggcctttgg tttcttctaa 180 caaaagaaac agcagcaagaacaaaagtgc gaggttcaag gacgacgagg agatggtgga 240 gatcacgctg gacgtccgcgacgacgccgt ttcggtccag aacatccgcg gcggcgactc 300 cgagacggcg ttcctcgccagccgcctcga gatgaggccg tcgtcgtttt ccgatcggct 360 gagacaggtg tcgcgggaactgaagcgcat gacatcaaac aaggccttcg atagggttga 420 ccgcagcaaa tccggtgctgcgcgcgccct tggtggtctt aagttcatga ccaaagcagg 480 cactgaaggt tggtcacaggttgagaagcg cttcgatgag ttggccattg atgcaaagct 540 ccccaagact cgctttagccagtgcatagg gatgaacgaa tcgaaggagt ttgccggtga 600 gttattcgat gcattagctcgtcgtcgagg gataacatca gcttccataa cgaaggatca 660 gttgcgtgaa ttttgggagcaaattacaga ccagagtttt gattcacggc ttcagacctt 720 ttttgacatg gtggacaaagatgccgatgg acgaattaat gaagaagaag taaaagagat 780 tatcacctta agcgcttcagccaataagct gtcaaaatta aaagatcgtg cagaggaata 840 tgctgccctt atcatggaggaactggatcc ggacaacctt ggatacattg agttatacaa 900 cttggaaatg cctcgtgc 918<210> SEQ ID NO 24 <211> LENGTH: 291 <212> TYPE: PRT <213> ORGANISM:Glycine max <400> SEQUENCE: 24 Met Glu Ile His Glu Asn Gln His Glu SerTrp Ser Glu Thr Glu Ser 1 5 10 15 Thr Gly Ser Arg Ser Arg Arg Val GlyPhe Ser Gly Pro Leu Ser Gly 20 25 30 Pro Leu Ser Gly Pro Leu Ser Gly ProLeu Val Ser Ser Asn Lys Arg 35 40 45 Asn Ser Ser Lys Asn Lys Ser Ala ArgPhe Lys Asp Asp Glu Glu Met 50 55 60 Val Glu Ile Thr Leu Asp Val Arg AspAsp Ala Val Ser Val Gln Asn 65 70 75 80 Ile Arg Gly Gly Asp Ser Glu ThrAla Phe Leu Ala Ser Arg Leu Glu 85 90 95 Met Arg Pro Ser Ser Phe Ser AspArg Leu Arg Gln Val Ser Arg Glu 100 105 110 Leu Lys Arg Met Thr Ser AsnLys Ala Phe Asp Arg Val Asp Arg Ser 115 120 125 Lys Ser Gly Ala Ala ArgAla Leu Gly Gly Leu Lys Phe Met Thr Lys 130 135 140 Ala Gly Thr Glu GlyTrp Ser Gln Val Glu Lys Arg Phe Asp Glu Leu 145 150 155 160 Ala Ile AspAla Lys Leu Pro Lys Thr Arg Phe Ser Gln Cys Ile Gly 165 170 175 Met AsnGlu Ser Lys Glu Phe Ala Gly Glu Leu Phe Asp Ala Leu Ala 180 185 190 ArgArg Arg Gly Ile Thr Ser Ala Ser Ile Thr Lys Asp Gln Leu Arg 195 200 205Glu Phe Trp Glu Gln Ile Thr Asp Gln Ser Phe Asp Ser Arg Leu Gln 210 215220 Thr Phe Phe Asp Met Val Asp Lys Asp Ala Asp Gly Arg Ile Asn Glu 225230 235 240 Glu Glu Val Lys Glu Ile Ile Thr Leu Ser Ala Ser Ala Asn LysLeu 245 250 255 Ser Lys Leu Lys Asp Arg Ala Glu Glu Tyr Ala Ala Leu IleMet Glu 260 265 270 Glu Leu Asp Pro Asp Asn Leu Gly Tyr Ile Glu Leu TyrAsn Leu Glu 275 280 285 Met Pro Arg 290 <210> SEQ ID NO 25 <211> LENGTH:876 <212> TYPE: DNA <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 25gcacgaggaa tacgatgtgc tcctgctcat tggactgggc attggagcca ctccattgat 60tagcattgtg aaggatgtgc ttaaccacac ccagcatgga ggatctgttt caggcacgga 120gcctgagggc agtggcaagg ccaagaagag gccattcatg acgaagaggg cctacttcta 180ctgggtgacc agagaagagg gatctttcga atggttccga ggggtcatga acgaggtggc 240tgagaaggac aaggatggag tcattgaact ccacaaccac tgctcgagtg tgtacgagga 300aggggatgca cgttctgcac tcattgtcat gctccaagag ctccagcatg cgaagaaagg 360ggtcgacatc ttgtctggaa ctagtgtcaa gacgcacttc gcgcgtccca attggcgaag 420cgtcttcaaa catgttgcag tgaaccatga gaaccaacgc gttggagttt tctactgcgg 480tgagcctgtc cttgtgccac agctacggca gtggtcagca gacttcaccc acaagacgaa 540cacaaagttt gagttccaca aggagaactt ctaatggcat aactggagaa gctctggacg 600tggtcctcta atgatttcag aatgtgaatt acgagggttg taatatatat acgagtagaa 660taaaatgatg ggagttattt tgttgtagat gccaaaagtc aaccatgttc tttttgcaat 720gtgctctgtc catccctgtt tactcctacc aggagatata ctctgtcgtc taggcgaagt 780agataaagtg tctcttgcat acaagcaaca ttagaagttg taacacatgc gaacatgtca 840tttcgttttt agaccatttg atagtcaaaa aaaaaa 876 <210> SEQ ID NO 26 <211>LENGTH: 190 <212> TYPE: PRT <213> ORGANISM: Triticum aestivum <400>SEQUENCE: 26 His Glu Glu Tyr Asp Val Leu Leu Leu Ile Gly Leu Gly Ile GlyAla 1 5 10 15 Thr Pro Leu Ile Ser Ile Val Lys Asp Val Leu Asn His ThrGln His 20 25 30 Gly Gly Ser Val Ser Gly Thr Glu Pro Glu Gly Ser Gly LysAla Lys 35 40 45 Lys Arg Pro Phe Met Thr Lys Arg Ala Tyr Phe Tyr Trp ValThr Arg 50 55 60 Glu Glu Gly Ser Phe Glu Trp Phe Arg Gly Val Met Asn GluVal Ala 65 70 75 80 Glu Lys Asp Lys Asp Gly Val Ile Glu Leu His Asn HisCys Ser Ser 85 90 95 Val Tyr Glu Glu Gly Asp Ala Arg Ser Ala Leu Ile ValMet Leu Gln 100 105 110 Glu Leu Gln His Ala Lys Lys Gly Val Asp Ile LeuSer Gly Thr Ser 115 120 125 Val Lys Thr His Phe Ala Arg Pro Asn Trp ArgSer Val Phe Lys His 130 135 140 Val Ala Val Asn His Glu Asn Gln Arg ValGly Val Phe Tyr Cys Gly 145 150 155 160 Glu Pro Val Leu Val Pro Gln LeuArg Gln Trp Ser Ala Asp Phe Thr 165 170 175 His Lys Thr Asn Thr Lys PheGlu Phe His Lys Glu Asn Phe 180 185 190 <210> SEQ ID NO 27 <211> LENGTH:553 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <220> FEATURE: <221>NAME/KEY: unsure <222> LOCATION: (391) <221> NAME/KEY: unsure <222>LOCATION: (442) <221> NAME/KEY: unsure <222> LOCATION: (453) <221>NAME/KEY: unsure <222> LOCATION: (519) <221> NAME/KEY: unsure <222>LOCATION: (552) <400> SEQUENCE: 27 gtttaaaccc ctttgattag cattgtgaaggacgtgctta accacattca aggtgaggga 60 tcagttggaa ccacggagcc ggagagcagcagcaaggcga agaagaaacc tttcatgacg 120 aagagagcct acttctactg ggtgacgagagaggagggct cgtttgagtg gttcagaggc 180 gtcatgaacg aggtgtctga gaaggacaaggatggagtca ttgagctcca taaccactgc 240 tcaagcgtgt accaggaagg cgatgctcgttctgctctca ttgtcatgct ccaagaactt 300 cagcatgcga agaagggcgt cgatatcttgtcgggaacta gtgtgaagac ccatttcgca 360 cgacctaatt ggcgaagcgt cttcaaagaangttgcggtc aaccatgaga accagcgcgt 420 ccgtgtgttc tactgtggtg ancctgtgctggntcccaac taaggcattt gtcagcagat 480 ttcacccaca agacaaacac aagatttgatttccacaang gagaacttct aatggtacaa 540 atttgagaaa tnc 553 <210> SEQ ID NO28 <211> LENGTH: 174 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <220>FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (132) <221> NAME/KEY:UNSURE <222> LOCATION: (149) <221> NAME/KEY: UNSURE <222> LOCATION:(153)..(154) <400> SEQUENCE: 28 Thr Pro Leu Ile Ser Ile Val Lys Asp ValLeu Asn His Ile Gln Gly 1 5 10 15 Glu Gly Ser Val Gly Thr Thr Glu ProGlu Ser Ser Ser Lys Ala Lys 20 25 30 Lys Lys Ala Lys Lys Lys Pro Phe MetThr Lys Arg Ala Tyr Phe Tyr 35 40 45 Trp Val Thr Arg Glu Glu Gly Ser PheGlu Trp Phe Arg Gly Val Met 50 55 60 Asn Glu Val Ser Glu Lys Asp Lys AspGly Val Ile Glu Leu His Asn 65 70 75 80 His Cys Ser Ser Val Tyr Gln GluGly Asp Ala Arg Ser Ala Leu Ile 85 90 95 Val Met Leu Gln Glu Leu Gln HisAla Lys Lys Gly Val Asp Ile Leu 100 105 110 Ser Gly Thr Ser Val Lys ThrHis Phe Ala Arg Pro Asn Trp Arg Ser 115 120 125 Val Phe Lys Xaa Val AlaVal Asn His Glu Asn Gln Arg Val Arg Val 130 135 140 Phe Tyr Cys Gly XaaPro Val Leu Xaa Xaa Gln Leu Arg His Leu Ser 145 150 155 160 Ala Asp PheThr His Lys Thr Asn Thr Arg Phe Asp Phe His 165 170 <210> SEQ ID NO 29<211> LENGTH: 685 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <400>SEQUENCE: 29 ggcacgaggt ttaaacccct ttgattagca ttgtgaagga cgtgcttaaccacattcaag 60 gtgagggatc agttggaacc acggagccgg agagcagcag caaggcgaagaagaaacctt 120 tcatgacgaa gagagcctac ttctactggg tgacgagaga ggagggctcgtttgagtggt 180 tcagaggcgt catgaacgag gtgtctgaga aggacaagga tggagtcattgagctccata 240 accactgctc aagcgtgtac caggaaggcg atgctcgttc tgctctcattgtcatgctcc 300 aagaacttca gcatgcgaag aagggcgtcg atatcttgtc gggaactagtgtgaagaccc 360 atttcgcacg acctaattgg cgaagcgtct tcaagaaggt tgcggtcaaccatgagaacc 420 agcgcgtcgg tgtgttctac tgtggtgagc ctgtgctggt tccccaactaaggcagttgt 480 cagcagattt cacccacaag acaaacacaa gatttgattt ccacaaggagaacttctaat 540 ggtacaaatt gagaaatacc cgtgtatggt tttgtatgta gttctttatcatgtgaatta 600 tatggtttct aatatatata aagttggaca aaataaatga aatgatggaagctattttgt 660 tttttaaaaa aaaaaaaaaa aaaaa 685 <210> SEQ ID NO 30 <211>LENGTH: 174 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE:30 Thr Pro Leu Ile Ser Ile Val Lys Asp Val Leu Asn His Ile Gln Gly 1 510 15 Glu Gly Ser Val Gly Thr Thr Glu Pro Glu Ser Ser Ser Lys Ala Lys 2025 30 Lys Lys Pro Phe Met Thr Lys Arg Ala Tyr Phe Tyr Trp Val Thr Arg 3540 45 Glu Glu Gly Ser Phe Glu Trp Phe Arg Gly Val Met Asn Glu Val Ser 5055 60 Glu Lys Asp Lys Asp Gly Val Ile Glu Leu His Asn His Cys Ser Ser 6570 75 80 Val Tyr Gln Glu Gly Asp Ala Arg Ser Ala Leu Ile Val Met Leu Gln85 90 95 Glu Leu Gln His Ala Lys Lys Gly Val Asp Ile Leu Ser Gly Thr Ser100 105 110 Val Lys Thr His Phe Ala Arg Pro Asn Trp Arg Ser Val Phe LysLys 115 120 125 Val Ala Val Asn His Glu Asn Gln Arg Val Gly Val Phe TyrCys Gly 130 135 140 Glu Pro Val Leu Val Pro Gln Leu Arg Gln Leu Ser AlaAsp Phe Thr 145 150 155 160 His Lys Thr Asn Thr Arg Phe Asp Phe His LysGlu Asn Phe 165 170 <210> SEQ ID NO 31 <211> LENGTH: 734 <212> TYPE: DNA<213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure <222>LOCATION: (9) <221> NAME/KEY: unsure <222> LOCATION: (503) <221>NAME/KEY: unsure <222> LOCATION: (612) <221> NAME/KEY: unsure <222>LOCATION: (634) <221> NAME/KEY: unsure <222> LOCATION: (678) <221>NAME/KEY: unsure <222> LOCATION: (697) <221> NAME/KEY: unsure <222>LOCATION: (717) <400> SEQUENCE: 31 gatcaacanc gcatagctcg aaacttagcaaagctcttag catgaagctt gcatcaaaca 60 aggacacagg tccattttac cactactggcaagagttcat gtacttcctt gaggagaact 120 ggaagcgcat ttgggttatg actctctggctctcaatctg cattggcctc tttatttgga 180 agttcatcca ataccgtaat cgagcagtatttcacatcat gggttattgt gtgaccactg 240 caaaaggtgc tgcagagact ctcaaattcaatatggccct ggttcttttt cctgtttgcc 300 gaaatacaat cacttggatt cgatcgaagacaaagatcgg agctgttgta cccttcaatg 360 ataacataaa cttccataag gtaatagcagcaggtgttgc agttggtgtt gctttgcatg 420 caggtgctca cctgacatgt gattttcctcggctgctcca tgcaagtgat gctgcctatg 480 aaccaatgaa gcctttcttt ggngacaaaaggccaccaaa ttactggtgg tttgtaaagg 540 gaactgaagg gtggacaggt gtggtcatggatgtacttaa gactatagcc ttcgtattgg 600 cccaaccatg gnttcggcgt aataagctcaaggnttctaa tcccctcaag aaaatgactg 660 gcttcaatgc cttttggntt acgcaccacttaattgntaa tgtgtatgca ctggccnttg 720 tccaagggga tttg 734 <210> SEQ ID NO32 <211> LENGTH: 234 <212> TYPE: PRT <213> ORGANISM: Zea mays <220>FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (196) <221> NAME/KEY:UNSURE <222> LOCATION: (203) <221> NAME/KEY: UNSURE <222> LOCATION:(218) <221> NAME/KEY: UNSURE <222> LOCATION: (224) <221> NAME/KEY:UNSURE <222> LOCATION: (231) <400> SEQUENCE: 32 Leu Ser Lys Ala Leu SerMet Lys Leu Ala Ser Asn Lys Asp Thr Gly 1 5 10 15 Pro Phe Tyr His TyrTrp Gln Glu Phe Met Tyr Phe Leu Glu Glu Asn 20 25 30 Trp Lys Arg Ile TrpVal Met Thr Leu Trp Leu Ser Ile Cys Ile Gly 35 40 45 Leu Phe Ile Trp LysPhe Ile Gln Tyr Arg Asn Arg Ala Val Phe His 50 55 60 Ile Met Gly Tyr CysVal Thr Thr Ala Lys Gly Ala Ala Glu Thr Leu 65 70 75 80 Lys Phe Asn MetAla Leu Val Leu Phe Pro Val Cys Arg Asn Thr Ile 85 90 95 Thr Trp Ile ArgSer Lys Thr Lys Ile Gly Ala Val Val Pro Phe Asn 100 105 110 Asp Asn IleAsn Phe His Lys Val Ile Ala Ala Gly Val Ala Val Gly 115 120 125 Val AlaLeu His Ala Gly Ala His Leu Thr Cys Asp Phe Pro Arg Leu 130 135 140 LeuHis Ala Ser Asp Ala Ala Tyr Glu Pro Met Lys Pro Phe Phe Gly 145 150 155160 Asp Arg Pro Pro Asn Tyr Trp Trp Phe Val Lys Gly Thr Glu Gly Trp 165170 175 Thr Gly Val Val Met Asp Val Leu Lys Thr Ile Ala Phe Val Leu Ala180 185 190 Gln Pro Trp Xaa Arg Arg Asn Lys Leu Lys Xaa Ser Asn Pro LeuLys 195 200 205 Lys Met Thr Gly Phe Asn Ala Phe Trp Xaa Thr His His LeuIle Xaa 210 215 220 Asn Val Tyr Ala Leu Ala Xaa Val Gln Gly 225 230<210> SEQ ID NO 33 <211> LENGTH: 600 <212> TYPE: DNA <213> ORGANISM:Oryza sativa <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (399)<221> NAME/KEY: unsure <222> LOCATION: (415) <221> NAME/KEY: unsure<222> LOCATION: (440) <221> NAME/KEY: unsure <222> LOCATION: (453) <221>NAME/KEY: unsure <222> LOCATION: (467) <221> NAME/KEY: unsure <222>LOCATION: (473) <221> NAME/KEY: unsure <222> LOCATION: (484) <221>NAME/KEY: unsure <222> LOCATION: (486) <221> NAME/KEY: unsure <222>LOCATION: (490) <221> NAME/KEY: unsure <222> LOCATION: (494) <221>NAME/KEY: unsure <222> LOCATION: (499) <221> NAME/KEY: unsure <222>LOCATION: (509) <221> NAME/KEY: unsure <222> LOCATION: (544) <221>NAME/KEY: unsure <222> LOCATION: (548) <221> NAME/KEY: unsure <222>LOCATION: (554)..(555) <221> NAME/KEY: unsure <222> LOCATION: (558)<221> NAME/KEY: unsure <222> LOCATION: (567) <221> NAME/KEY: unsure<222> LOCATION: (576) <221> NAME/KEY: unsure <222> LOCATION: (581) <221>NAME/KEY: unsure <222> LOCATION: (585) <221> NAME/KEY: unsure <222>LOCATION: (594) <400> SEQUENCE: 33 gtggttcgtg aaggggacgg aggggtggacggggctggtg atgctggtgc tcatggcggt 60 ggcgttcacc ctcgccacgc cgtggttccgccgcggccgc ctccgcctcc cccgcccgct 120 caaccgcctc acggggttca acgccttctggtactcccac cactgcttcg tcatcgtcta 180 cgccctcctc atcgtccacg gctactacctcttccttacc aaggattggt acaagaaaac 240 gacgtggatg tacctggcgg tgccgatgttcctgtacgcg tgcgagaggc tgacgagggc 300 gctccggtcg agcgtgaggc cagtgaagatatcaagttgc ggtgtacccc ggaaatgtgt 360 gtcgctgact tctccaagcc aagggtttcaagacaagant ggcagacatt tcgtnactgt 420 gcgccgtctg ccgttcatgn accattctcatangtggcca cagacgntag tangtcaata 480 gagntngtan tganaggant aagaagttntaaggtttcgg cacgacgaag aaaacggttt 540 cggngagnaa acgnnggnat acaacangtccaagtntata nggcnaggaa cggnagaaaa 600 <210> SEQ ID NO 34 <211> LENGTH:120 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 34 TrpPhe Val Lys Gly Thr Glu Gly Trp Thr Gly Leu Val Met Leu Val 1 5 10 15Leu Met Ala Val Ala Phe Thr Leu Ala Thr Pro Trp Phe Arg Arg Gly 20 25 30Arg Leu Arg Leu Pro Arg Pro Leu Asn Arg Leu Thr Gly Phe Asn Ala 35 40 45Phe Trp Tyr Ser His His Cys Phe Val Ile Val Tyr Ala Leu Leu Ile 50 55 60Val His Gly Tyr Tyr Leu Phe Leu Thr Lys Asp Trp Tyr Lys Lys Thr 65 70 7580 Thr Trp Met Tyr Leu Ala Val Pro Met Phe Leu Tyr Ala Cys Glu Arg 85 9095 Leu Thr Arg Ala Leu Arg Ser Ser Val Arg Pro Val Lys Ile Ser Gln 100105 110 Val Ala Val Tyr Pro Gly Asn Val 115 120 <210> SEQ ID NO 35 <211>LENGTH: 549 <212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE:<221> NAME/KEY: unsure <222> LOCATION: (467) <221> NAME/KEY: unsure<222> LOCATION: (520) <221> NAME/KEY: unsure <222> LOCATION: (534) <221>NAME/KEY: unsure <222> LOCATION: (549) <400> SEQUENCE: 35 aacaaactgtcaaacatcca gaagcaagca gaggaatatg cggctttgat tatggaagaa 60 ttggaccctgatgacacagg atacatcatg atagacaacc tggagacgct cttgttgcat 120 ggaccagaggaaactacaag aggagaaagt aagtacctga gccaaatgct aagtcaaaaa 180 cttaagcctacatttgcgga cagtgcagtt atgaggtggt gtagagatgc caagtacttc 240 ttgctggacaactggcaaag atcttgggta cttgcacttt ggattggtgt gatgtttggc 300 ctatttgcctataaatttgt gcaatatagg agaaaagctg cctatgaggt gatgggccat 360 tgtgtatgcatggcaaaagg tgcagccgag acacttaaat tgaacatggt cttatcttgt 420 tacctgtttgccgcaacacc catcacctgg ctcaggaata agaccangct aggtgtgtag 480 ttcctttggatgacaacatc aacttccaca aggtatagcn gtggcatagc agtngctgtt 540 gctgtgaan 549<210> SEQ ID NO 36 <211> LENGTH: 159 <212> TYPE: PRT <213> ORGANISM:Glycine max <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (48)<221> NAME/KEY: UNSURE <222> LOCATION: (157) <400> SEQUENCE: 36 Asn LysLeu Ser Asn Ile Gln Lys Gln Ala Glu Glu Tyr Ala Ala Leu 1 5 10 15 IleMet Glu Glu Leu Asp Pro Asp Asp Thr Gly Tyr Ile Met Ile Asp 20 25 30 AsnLeu Glu Thr Leu Leu Leu His Gly Pro Glu Glu Thr Thr Arg Xaa 35 40 45 GlyGlu Ser Lys Tyr Leu Ser Gln Met Leu Ser Gln Lys Leu Lys Pro 50 55 60 ThrPhe Ala Asp Ser Ala Val Met Arg Trp Cys Arg Asp Ala Lys Tyr 65 70 75 80Phe Leu Leu Asp Asn Trp Gln Arg Ser Trp Val Leu Ala Leu Trp Ile 85 90 95Gly Val Met Phe Gly Leu Phe Ala Tyr Lys Phe Val Gln Tyr Arg Arg 100 105110 Lys Ala Ala Tyr Glu Val Met Gly His Cys Val Cys Met Ala Lys Gly 115120 125 Ala Ala Glu Thr Leu Lys Leu Asn Met Val Leu Ser Cys Tyr Leu Phe130 135 140 Ala Ala Thr Pro Ile Thr Trp Leu Arg Asn Lys Thr Xaa Leu Gly145 150 155 <210> SEQ ID NO 37 <211> LENGTH: 483 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <220> FEATURE: <221> NAME/KEY: unsure <222>LOCATION: (2) <221> NAME/KEY: unsure <222> LOCATION: (10) <221>NAME/KEY: unsure <222> LOCATION: (77) <221> NAME/KEY: unsure <222>LOCATION: (181) <221> NAME/KEY: unsure <222> LOCATION: (245) <221>NAME/KEY: unsure <222> LOCATION: (265) <221> NAME/KEY: unsure <222>LOCATION: (273) <221> NAME/KEY: unsure <222> LOCATION: (285) <221>NAME/KEY: unsure <222> LOCATION: (324) <221> NAME/KEY: unsure <222>LOCATION: (327) <221> NAME/KEY: unsure <222> LOCATION: (366) <221>NAME/KEY: unsure <222> LOCATION: (369) <221> NAME/KEY: unsure <222>LOCATION: (373) <221> NAME/KEY: unsure <222> LOCATION: (385) <221>NAME/KEY: unsure <222> LOCATION: (392) <221> NAME/KEY: unsure <222>LOCATION: (402) <221> NAME/KEY: unsure <222> LOCATION: (438) <221>NAME/KEY: unsure <222> LOCATION: (445) <221> NAME/KEY: unsure <222>LOCATION: (447) <221> NAME/KEY: unsure <222> LOCATION: (457) <221>NAME/KEY: unsure <222> LOCATION: (462) <221> NAME/KEY: unsure <222>LOCATION: (464) <221> NAME/KEY: unsure <222> LOCATION: (481) <221>NAME/KEY: unsure <222> LOCATION: (483) <400> SEQUENCE: 37 gntgagccancacctccggc cgacggggga gcccaacccg ctccggcgct ggtaccgccg 60 cgccagctacttcctcnagg acaactggcg gcgctgctgg gtgatcctgc tgtggctctc 120 catctgcgtgggcctcttca cgtggaagtt catgcagtac cgcgagcgcg ccgtgttcaa 180 ngtgatgggctactgcgtgt gcgtgggcaa aggcggcgcc gagacgctca agttcaacat 240 ggggntcatcctgctccccg tgtgncgcaa aancaacaac gtggntccgc aaacgcaccg 300 ccgcgggccggttcgtggcg ttcnacnaca acatcaactt ccacaaaggg attccccgcg 360 gggatntcngtcngggcggg ggtgnaaaac anctcccaat tntcgtgcga atttccgcgc 420 ctgctggacgccaacgangg gaggngnaag aagccangaa ancngttctt tcggggatta 480 ncn 483 <210>SEQ ID NO 38 <211> LENGTH: 143 <212> TYPE: PRT <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (2) <221>NAME/KEY: UNSURE <222> LOCATION: (25) <221> NAME/KEY: UNSURE <222>LOCATION: (59) <221> NAME/KEY: UNSURE <222> LOCATION: (81) <221>NAME/KEY: UNSURE <222> LOCATION: (87) <221> NAME/KEY: UNSURE <222>LOCATION: (90) <221> NAME/KEY: UNSURE <222> LOCATION: (94) <221>NAME/KEY: UNSURE <222> LOCATION: (107)..(108) <221> NAME/KEY: UNSURE<222> LOCATION: (120) <221> NAME/KEY: UNSURE <222> LOCATION: (123) <221>NAME/KEY: UNSURE <222> LOCATION: (127) <221> NAME/KEY: UNSURE <222>LOCATION: (129) <221> NAME/KEY: UNSURE <222> LOCATION: (132) <400>SEQUENCE: 38 Ser Xaa His Leu Arg Pro Thr Gly Glu Pro Asn Pro Leu Arg ArgTrp 1 5 10 15 Tyr Arg Arg Ala Ser Tyr Phe Leu Xaa Asp Asn Trp Arg ArgCys Trp 20 25 30 Val Ile Leu Leu Trp Leu Ser Ile Cys Val Gly Leu Phe ThrTrp Lys 35 40 45 Phe Met Gln Tyr Arg Glu Arg Ala Val Phe Xaa Val Met GlyTyr Cys 50 55 60 Val Cys Val Gly Lys Gly Gly Ala Glu Thr Leu Lys Phe AsnMet Gly 65 70 75 80 Xaa Ile Leu Leu Pro Cys Xaa Ala Lys Xaa Thr Thr TrpXaa Arg Lys 85 90 95 Arg Thr Ala Ala Gly Arg Phe Val Ala Phe Xaa Xaa AsnIle Asn Phe 100 105 110 His Lys Gly Phe Pro Ala Gly Xaa Ser Val Xaa AlaGly Val Xaa Asn 115 120 125 Xaa Ser Gln Xaa Ser Cys Glu Phe Pro Arg LeuLeu Asp Ala Asn 130 135 140 <210> SEQ ID NO 39 <211> LENGTH: 1631 <212>TYPE: DNA <213> ORGANISM: Oryza sativa <400> SEQUENCE: 39 gcacgaggtggttcgtgaag gggacggagg ggtggacggg gctggtgatg ctggtgctca 60 tggcggtggcgttcaccctc gccacgccgt ggttccgccg cggccgcctc cgcctccccc 120 gcccgctcaaccgcctcacg gggttcaacg ccttctggta ctcccaccac tgcttcgtca 180 tcgtctacgccctcctcatc gtccacggct actacctctt ccttaccaag gattggtaca 240 agaaaacgacgtggatgtac ctggcggtgc cgatgttcct gtacgcgtgc gagaggctga 300 cgagggcgctccggtcgagc gtgaggccag tgaagatact caaggttgcg gtgtaccccg 360 gaaatgtgctgtcgctgcac ttctccaagc cacagggttt caagtacaag agtgggcagt 420 acatcttcgtcaactgtgcc gccgtctcgc cgttccaatg gcacccattc tccatcacgt 480 cggccccacaggacgactac gtcagcgtcc acatcaggac gctcggtgac tggacacggg 540 agcttaagaacgtcttctca agggtctgcc ggccaccgac ggaagggaag agcgggttgc 600 tccgggcggagtacgaccgc gacggcgcca tgaccaaccc aagcttcccc aaggtgctta 660 tcgacgggccgtacggcgca ccggcgcagg actacaagca gtacgacatc gtcctcctcg 720 tcggcctcgggatcggcgcc accccgatga tctccatcat caaggacatc ataaacaaca 780 tgaggcagctggacggcgac ctcgaggacg gcgacggcaa cgataactcg gtgtcgtcgt 840 cgtcggcggcgttcaagacg cggcgcgcct acttctactg ggtgacgcgg gagcaggggt 900 cgttcgagtggttccggggg gtgatggacg aggtggcgga gacggacaag aagggggtga 960 tcgagctgcacaactactgc accagcgtgt acgaggaagg ggacgcccgg tcggcgctca 1020 tcgctatgctccagtcgctc aaccacgcca agcacggcgt cgacgtcgtc tccggcaccc 1080 gcgtcaagacccacttcgcc cgccccaact ggcgcaacgt ctacaagcgc atcgccctca 1140 accaccgcgaccaacgcgtc ggggtgttct actgtggcgc gccggtgctg acgaaggaac 1200 tgcgtgagctcgctcaagat ttctcgagaa agacgagcac gaaattcgac ttccacaagg 1260 agaatttctagttatctgga atcaaaacca aagttttcgc acggccatgt ttagtacata 1320 caaagttctatacatatgac aagtatgatg acatacatat tggaaatgta gagggattag 1380 atcaaagtaggtattgcttg attgtggcca ggcttggcca gataatttca tcggtttttt 1440 gctctggaagaataatccaa tgcccccctt tgtacagatc ttctcccaga taatactttg 1500 taatacttagagtagccaat ttgataaaat cagtttgtat ctagtaacat gtagagagtt 1560 tcatggaggcctaatcaggt caaaaatatc acaaatgttt ggccaagaac aagaaaaaaa 1620 aaaaaaaaaa a1631 <210> SEQ ID NO 40 <211> LENGTH: 422 <212> TYPE: PRT <213>ORGANISM: Oryza sativa <400> SEQUENCE: 40 Thr Arg Trp Phe Val Lys GlyThr Glu Gly Trp Thr Gly Leu Val Met 1 5 10 15 Leu Val Leu Met Ala ValAla Phe Thr Leu Ala Thr Pro Trp Phe Arg 20 25 30 Arg Gly Arg Leu Arg LeuPro Arg Pro Leu Asn Arg Leu Thr Gly Phe 35 40 45 Asn Ala Phe Trp Tyr SerHis His Cys Phe Val Ile Val Tyr Ala Leu 50 55 60 Leu Ile Val His Gly TyrTyr Leu Phe Leu Thr Lys Asp Trp Tyr Lys 65 70 75 80 Lys Thr Thr Trp MetTyr Leu Ala Val Pro Met Phe Leu Tyr Ala Cys 85 90 95 Glu Arg Leu Thr ArgAla Leu Arg Ser Ser Val Arg Pro Val Lys Ile 100 105 110 Leu Lys Val AlaVal Tyr Pro Gly Asn Val Leu Ser Leu His Phe Ser 115 120 125 Lys Pro GlnGly Phe Lys Tyr Lys Ser Gly Gln Tyr Ile Phe Val Asn 130 135 140 Cys AlaAla Val Ser Pro Phe Gln Trp His Pro Phe Ser Ile Thr Ser 145 150 155 160Ala Pro Gln Asp Asp Tyr Val Ser Val His Ile Arg Thr Leu Gly Asp 165 170175 Trp Thr Arg Glu Leu Lys Asn Val Phe Ser Arg Val Cys Arg Pro Pro 180185 190 Thr Glu Gly Lys Ser Gly Leu Leu Arg Ala Glu Tyr Asp Arg Asp Gly195 200 205 Ala Met Thr Asn Pro Ser Phe Pro Lys Val Leu Ile Asp Gly ProTyr 210 215 220 Gly Ala Pro Ala Gln Asp Tyr Lys Gln Tyr Asp Ile Val LeuLeu Val 225 230 235 240 Gly Leu Gly Ile Gly Ala Thr Pro Met Ile Ser IleIle Lys Asp Ile 245 250 255 Ile Asn Asn Met Arg Gln Leu Asp Gly Asp LeuGlu Asp Gly Asp Gly 260 265 270 Asn Asp Asn Ser Val Ser Ser Ser Ser AlaAla Phe Lys Thr Arg Arg 275 280 285 Ala Tyr Phe Tyr Trp Val Thr Arg GluGln Gly Ser Phe Glu Trp Phe 290 295 300 Arg Gly Val Met Asp Glu Val AlaGlu Thr Asp Lys Lys Gly Val Ile 305 310 315 320 Glu Leu His Asn Tyr CysThr Ser Val Tyr Glu Glu Gly Asp Ala Arg 325 330 335 Ser Ala Leu Ile AlaMet Leu Gln Ser Leu Asn His Ala Lys His Gly 340 345 350 Val Asp Val ValSer Gly Thr Arg Val Lys Thr His Phe Ala Arg Pro 355 360 365 Asn Trp ArgAsn Val Tyr Lys Arg Ile Ala Leu Asn His Arg Asp Gln 370 375 380 Arg ValGly Val Phe Tyr Cys Gly Ala Pro Val Leu Thr Lys Glu Leu 385 390 395 400Arg Glu Leu Ala Gln Asp Phe Ser Arg Lys Thr Ser Thr Lys Phe Asp 405 410415 Phe His Lys Glu Asn Phe 420 <210> SEQ ID NO 41 <211> LENGTH: 2365<212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 41gcacgagaac aaactgtcaa acatccagaa gcaagcagag gaatatgcgg ctttgattat 60ggaagaattg gaccctgatg acacaggata catcatgata gacaacctgg agacgctctt 120gttgcatgga ccagaggaaa ctacaagagg agaaagtaag tacctgagcc aaatgctaag 180tcaaaaactt aagcctacat ttgcggacag tgcagttatg aggtggtgta gagatgccaa 240gtacttcttg ctggacaact ggcaaagatc ttgggtactt gcactttgga ttggtgtgat 300gtttggccta tttgcctata aatttgtgca atataggaga aaagctgcct atgaggtgat 360gggccattgt gtatgcatgg caaaaggtgc agccgagaca cttaaattga acatggctct 420tatcttgtta cctgtttgcc gcaacaccat cacctggctc aggaataaga ccaagctagg 480tgttgtagtt cctttggatg acaacatcaa cttccacaag gtaatagctg tggcaatagc 540agttgctgtt gctgtacatt ccatctatca tcttacttgt gattttcctc gccttcttca 600tgcaagcgat gaaaagtaca agctcatgca accttttttc ggagacagac catcagatta 660ttggtatttt gtcaaatcat gggaaggagt aacagggatt ataatagttg tgctaatggc 720aatagccttt acgctggcta atcctaggtt caggagaggc cgggccaaac tacccaaacc 780tttcaacaaa ttcacgggtt tcaatgcctt ttggtattcc catcacctct tcgtcattgt 840ctatgccctg ttggttgtac atggaatcaa actttacttg actaaggaat ggtacaagaa 900aacgacctgg atgtatctgg ctattcccat caccatttat gcattggaaa gactggttag 960agcattcaga tcgagcatta agtccgtcaa aatattgaag gtgactcttt atcctggaaa 1020cgtgttatca cttaaaatgt caaagccgca ggggtttagc tacaaaagtg gacaatacat 1080gtttgtgaat tgtgctgctg tgtctccatt tgaatggcat ccattttcca taacttccgc 1140ccctgatgat gattacctta gcgttcacat aaaaatactt ggtgactgga ctcgaagtct 1200gaaagccaaa ttcacacagg cgtgccagca acccctaaat ggacagagtg gacttctaag 1260agctgaatgc ttgaaaggag ataacagccc aagttccttt cctaaggttc tggtggatgg 1320tccgtatggg gcgccagcac aagactacag ggagtatgag gtggtgttgc tggtggggct 1380tggaattggg gctacaccaa tgataagtat actaaaggac atggtgaata attttaaggc 1440gaatgatgag gaggagggag ggcaagagag ggtgagtgac ttcaagacaa ggagggcata 1500cttctactgg gtgactacac atcacggttc atttgactgg ttcaaagggg taatgaacga 1560agtggcagaa gaggaccgaa ggaaagtgat tgaactccac agctactgca ccagcgtcta 1620cgaagagggt gatgctcgct ctgctcttat tgctatgttg cagtccctaa accatgcaaa 1680gaatggagtg gacattgtct ctgggacacg agtcatgtct cactttgcaa aacccaattg 1740gcgcagtgtc tacaagcgca ttgcgcttaa tcatccagat gcccgagttg gtcagttccc 1800accccttcta ctcacgttgc gtctgctttt tgcttttttc caaattcatt aaattgcaat 1860gcaggggtat tttactgtgg gccatcagcc ctcacccatg agcttcgtca gctagcattg 1920gatttctctc acaacacatc caccaagtac gacttccata aagaaaattt ctgacaagat 1980cgagctaagc cgtccaccac ctagatcatt ccactttgtt ttttgcctat aaataattaa 2040gtttctacat gtggttctta tttatggctt tgtaaaagca ccaaggcagg tgggagctaa 2100atggttaaaa tagagttgcc taccaattat aaaaggtgtg gacgtctgtt aatgtctctg 2160cgtgtcttta tgtccatgta aatttaagga gaaataacca ctccttggaa acaccttggg 2220agaaactaga aagtaataca gaagagagaa taatgtctag ctggagcgca cttgatactt 2280tgtattaatc cccgatgcct tttagctgat caattgcaga attcaggaga ttcatttata 2340aatattataa aaaaaaaaaa aaaaa 2365 <210> SEQ ID NO 42 <211> LENGTH: 616<212> TYPE: PRT <213> ORGANISM: Glycine max <400> SEQUENCE: 42 His GluAsn Lys Leu Ser Asn Ile Gln Lys Gln Ala Glu Glu Tyr Ala 1 5 10 15 AlaLeu Ile Met Glu Glu Leu Asp Pro Asp Asp Thr Gly Tyr Ile Met 20 25 30 IleAsp Asn Leu Glu Thr Leu Leu Leu His Gly Pro Glu Glu Thr Thr 35 40 45 ArgGly Glu Ser Lys Tyr Leu Ser Gln Met Leu Ser Gln Lys Leu Lys 50 55 60 ProThr Phe Ala Asp Ser Ala Val Met Arg Trp Cys Arg Asp Ala Lys 65 70 75 80Tyr Phe Leu Leu Asp Asn Trp Gln Arg Ser Trp Val Leu Ala Leu Trp 85 90 95Ile Gly Val Met Phe Gly Leu Phe Ala Tyr Lys Phe Val Gln Tyr Arg 100 105110 Arg Lys Ala Ala Tyr Glu Val Met Gly His Cys Val Cys Met Ala Lys 115120 125 Gly Ala Ala Glu Thr Leu Lys Leu Asn Met Ala Leu Ile Leu Leu Pro130 135 140 Val Cys Arg Asn Thr Ile Thr Trp Leu Arg Asn Lys Thr Lys LeuGly 145 150 155 160 Val Val Val Pro Leu Asp Asp Asn Ile Asn Phe His LysVal Ile Ala 165 170 175 Val Ala Ile Ala Val Ala Val Ala Val His Ser IleTyr His Leu Thr 180 185 190 Cys Asp Phe Pro Arg Leu Leu His Ala Ser AspGlu Lys Tyr Lys Leu 195 200 205 Met Gln Pro Phe Phe Gly Asp Arg Pro SerAsp Tyr Trp Tyr Phe Val 210 215 220 Lys Ser Trp Glu Gly Val Thr Gly IleIle Ile Val Val Leu Met Ala 225 230 235 240 Ile Ala Phe Thr Leu Ala AsnPro Arg Phe Arg Arg Gly Arg Ala Lys 245 250 255 Leu Pro Lys Pro Phe AsnLys Phe Thr Gly Phe Asn Ala Phe Trp Tyr 260 265 270 Ser His His Leu PheVal Ile Val Tyr Ala Leu Leu Val Val His Gly 275 280 285 Ile Lys Leu TyrLeu Thr Lys Glu Trp Tyr Lys Lys Thr Thr Trp Met 290 295 300 Tyr Leu AlaIle Pro Ile Thr Ile Tyr Ala Leu Glu Arg Leu Val Arg 305 310 315 320 AlaPhe Arg Ser Ser Ile Lys Ser Val Lys Ile Leu Lys Val Thr Leu 325 330 335Tyr Pro Gly Asn Val Leu Ser Leu Lys Met Ser Lys Pro Gln Gly Phe 340 345350 Ser Tyr Lys Ser Gly Gln Tyr Met Phe Val Asn Cys Ala Ala Val Ser 355360 365 Pro Phe Glu Trp His Pro Phe Ser Ile Thr Ser Ala Pro Asp Asp Asp370 375 380 Tyr Leu Ser Val His Ile Lys Ile Leu Gly Asp Trp Thr Arg SerLeu 385 390 395 400 Lys Ala Lys Phe Thr Gln Ala Cys Gln Gln Pro Leu AsnGly Gln Ser 405 410 415 Gly Leu Leu Arg Ala Glu Cys Leu Lys Gly Asp AsnSer Pro Ser Ser 420 425 430 Phe Pro Lys Val Leu Val Asp Gly Pro Tyr GlyAla Pro Ala Gln Asp 435 440 445 Tyr Arg Glu Tyr Glu Val Val Leu Leu ValGly Leu Gly Ile Gly Ala 450 455 460 Thr Pro Met Ile Ser Ile Leu Lys AspMet Val Asn Asn Phe Lys Ala 465 470 475 480 Asn Asp Glu Glu Glu Gly GlyGln Glu Arg Val Ser Asp Phe Lys Thr 485 490 495 Arg Arg Ala Tyr Phe TyrTrp Val Thr Thr His His Gly Ser Phe Asp 500 505 510 Trp Phe Lys Gly ValMet Asn Glu Val Ala Glu Glu Asp Arg Arg Lys 515 520 525 Val Ile Glu LeuHis Ser Tyr Cys Thr Ser Val Tyr Glu Glu Gly Asp 530 535 540 Ala Arg SerAla Leu Ile Ala Met Leu Gln Ser Leu Asn His Ala Lys 545 550 555 560 AsnGly Val Asp Ile Val Ser Gly Thr Arg Val Met Ser His Phe Ala 565 570 575Lys Pro Asn Trp Arg Ser Val Tyr Lys Arg Ile Ala Leu Asn His Pro 580 585590 Asp Ala Arg Val Gly Gln Phe Pro Pro Leu Leu Leu Thr Leu Arg Leu 595600 605 Leu Phe Ala Phe Phe Gln Ile His 610 615 <210> SEQ ID NO 43 <211>LENGTH: 1993 <212> TYPE: DNA <213> ORGANISM: Triticum aestivum <400>SEQUENCE: 43 gcacgaggct gagccagcac ctccggccga cggcggagcc caacccgctccggcgctggt 60 accgccgcgc cagctacttc ctcgaggaca actggcggcg ctgctgggtgatcctgctgt 120 ggctctccat ctgcgtgggc ctcttcacgt ggaagttcat gcagtaccgcgagcgcgccg 180 tgttcaaggt gatgggctac tgcgtgtgcg tggccaaggg cggcgccgagacgctcaagt 240 tcaacatggc gctcatcctg ctccccgtgt gccgcaacac catcacgtggttccgcaacc 300 gcaccgccgc gggccggttc gtgccgttcg acgacaacat caacttccacaaggtgatcg 360 ccgcggggat ctcggtcggc gcggggctgc acatcatctc ccatttgacgtgcgacttcc 420 cgcgcctgct gcacgccacc gaggaggagt acgagcccat gaagcggttcttcggggatg 480 accagccgcc caactactgg tggttcgtca agggcacgga ggggtggacggggctggtga 540 tgctggtgct catggcgatc gccttcacgc tcgccatgcc gtggttccgccgcggcaggc 600 tcagcctccc caagccgctc aaccggctca ccgggttcaa cgccttctggtactcgcacc 660 acctcttcgt catcgtctac gcgctgctca tcgtccacgg ccacttcctctacctcacca 720 agaagtggca aaagaagtcg acgtggatgt acctggcggt gccgatggtgatgtacgcgt 780 gcgagcggct gacgcgggcg ctgcggtcga gcgtgcggcc ggtgaagatactcaaggtgg 840 cggtgtaccc cggcaacgtg ctgtcgctgc acttctccaa gccgcaggggttccggtaca 900 agagcgggca gtacatcttc gtcaactgcg ccgccgtctc gccgttccaatggcacccgt 960 tctccatcac gtcggcgccg caggacgact acgtgagcgt gcacatcaggacgctggggg 1020 actggacccg ggagctcaag aacgtcttct ccaaggtctg ccggccgccgacggagggca 1080 agagcggcct gctccgggcc gagtacgacc gcgacgtcgg cgccatgtccaacccgagct 1140 tcccaaaggt gctgatcgac ggcccctacg gcgcgccggc gcaggactacaagcaatacg 1200 acatcgtgct gctcgtgggg ctgggcatcg gggccacacc catgatctccatcatcaagg 1260 acatcatcaa caacatgaag cggctcgaag gagacgtcga gtccggcaaccccggggacg 1320 cgagcacgtc ggcgtccttc cggacccggc gcgcctactt ctactgggtgacgcgggagc 1380 aaggctcctt cgagtggttc cggggcgtca tggacgagat agctgagtcggacaagaaga 1440 gtgtcatcga gctccataac tactgcacca gtgtctacga ggacggggacgcccggtccg 1500 cgctcatcgc catgctccag tccctcaacc atgccaagaa cggcgtcgacatcgtctccg 1560 gcacccgtgt caagacccac tttgcgcgac caaactggcg caacgtctacaagcgtatcg 1620 ccctcaacca ccgtgaacag cgtgtcggag tattctactg cggtgcaccggtgctaacaa 1680 aggagctgcg tgaccttgca caagatttct cgagaaagac aaacacaaaattcgagttcc 1740 acaaggagaa tttttaactt ctgtagacga cccagccaca agaagttgctctttttgccg 1800 aaagtgtgtt aattcagtac atagcaagtt cttatacttg tgagtagagcaactagatca 1860 aaggagatat tgaatgtgtt gatttattag tacatagcaa gtcttgtacttatgaaaagt 1920 agaggggcta gatcaaagta ggtattggtt gattataaca agtttgatcagatttaaaaa 1980 aaaaaaaaaa aaa 1993 <210> SEQ ID NO 44 <211> LENGTH: 584<212> TYPE: PRT <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 44 ThrArg Leu Ser Gln His Leu Arg Pro Thr Ala Glu Pro Asn Pro Leu 1 5 10 15Arg Arg Trp Tyr Arg Arg Ala Ser Tyr Phe Leu Glu Asp Asn Trp Arg 20 25 30Arg Cys Trp Val Ile Leu Leu Trp Leu Ser Ile Cys Val Gly Leu Phe 35 40 45Thr Trp Lys Phe Met Gln Tyr Arg Glu Arg Ala Val Phe Lys Val Met 50 55 60Gly Tyr Cys Val Cys Val Ala Lys Gly Gly Ala Glu Thr Leu Lys Phe 65 70 7580 Asn Met Ala Leu Ile Leu Leu Pro Val Cys Arg Asn Thr Ile Thr Trp 85 9095 Phe Arg Asn Arg Thr Ala Ala Gly Arg Phe Val Pro Phe Asp Asp Asn 100105 110 Ile Asn Phe His Lys Val Ile Ala Ala Gly Ile Ser Val Gly Ala Gly115 120 125 Leu His Ile Ile Ser His Leu Thr Cys Asp Phe Pro Arg Leu LeuHis 130 135 140 Ala Thr Glu Glu Glu Tyr Glu Pro Met Lys Arg Phe Phe GlyAsp Asp 145 150 155 160 Gln Pro Pro Asn Tyr Trp Trp Phe Val Lys Gly ThrGlu Gly Trp Thr 165 170 175 Gly Leu Val Met Leu Val Leu Met Ala Ile AlaPhe Thr Leu Ala Met 180 185 190 Pro Trp Phe Arg Arg Gly Arg Leu Ser LeuPro Lys Pro Leu Asn Arg 195 200 205 Leu Thr Gly Phe Asn Ala Phe Trp TyrSer His His Leu Phe Val Ile 210 215 220 Val Tyr Ala Leu Leu Ile Val HisGly His Phe Leu Tyr Leu Thr Lys 225 230 235 240 Lys Trp Gln Lys Lys SerThr Trp Met Tyr Leu Ala Val Pro Met Val 245 250 255 Met Tyr Ala Cys GluArg Leu Thr Arg Ala Leu Arg Ser Ser Val Arg 260 265 270 Pro Val Lys IleLeu Lys Val Ala Val Tyr Pro Gly Asn Val Leu Ser 275 280 285 Leu His PheSer Lys Pro Gln Gly Phe Arg Tyr Lys Ser Gly Gln Tyr 290 295 300 Ile PheVal Asn Cys Ala Ala Val Ser Pro Phe Gln Trp His Pro Phe 305 310 315 320Ser Ile Thr Ser Ala Pro Gln Asp Asp Tyr Val Ser Val His Ile Arg 325 330335 Thr Leu Gly Asp Trp Thr Arg Glu Leu Lys Asn Val Phe Ser Lys Val 340345 350 Cys Arg Pro Pro Thr Glu Gly Lys Ser Gly Leu Leu Arg Ala Glu Tyr355 360 365 Asp Arg Asp Val Gly Ala Met Ser Asn Pro Ser Phe Pro Lys ValLeu 370 375 380 Ile Asp Gly Pro Tyr Gly Ala Pro Ala Gln Asp Tyr Lys GlnTyr Asp 385 390 395 400 Ile Val Leu Leu Val Gly Leu Gly Ile Gly Ala ThrPro Met Ile Ser 405 410 415 Ile Ile Lys Asp Ile Ile Asn Asn Met Lys ArgLeu Glu Gly Asp Val 420 425 430 Glu Ser Gly Asn Pro Gly Asp Ala Ser ThrSer Ala Ser Phe Arg Thr 435 440 445 Arg Arg Ala Tyr Phe Tyr Trp Val ThrArg Glu Gln Gly Ser Phe Glu 450 455 460 Trp Phe Arg Gly Val Met Asp GluIle Ala Glu Ser Asp Lys Lys Ser 465 470 475 480 Val Ile Glu Leu His AsnTyr Cys Thr Ser Val Tyr Glu Asp Gly Asp 485 490 495 Ala Arg Ser Ala LeuIle Ala Met Leu Gln Ser Leu Asn His Ala Lys 500 505 510 Asn Gly Val AspIle Val Ser Gly Thr Arg Val Lys Thr His Phe Ala 515 520 525 Arg Pro AsnTrp Arg Asn Val Tyr Lys Arg Ile Ala Leu Asn His Arg 530 535 540 Glu GlnArg Val Gly Val Phe Tyr Cys Gly Ala Pro Val Leu Thr Lys 545 550 555 560Glu Leu Arg Asp Leu Ala Gln Asp Phe Ser Arg Lys Thr Asn Thr Lys 565 570575 Phe Glu Phe His Lys Glu Asn Phe 580 <210> SEQ ID NO 45 <211> LENGTH:672 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 45caagctggta cgcctgcagg taccggtccg gaattcccgg gtcgacccac gcgtccgctg 60ccaataagct gtccaggctt aaggagcaag ccgaagagta tgccgccctg attatggagg 120agcttgatcc tgaaggactt ggctacattg agttgtggca gttggagacc cttctgttgc 180agaaggatac ctacatgaac tacagtcagg cgctaagtta cacaagccaa gcactaagtc 240agaacctagc aggcttaagg aaaaagagtc caattcgcaa gataagtacc acattgagtt 300actatttgga ggataattgg aaacgcctat gggtgcttgc attatggatt ggaataatgg 360ctggactgtt cacttggaag ttcatgcagt atcgcaacag gtatgtcttt aatgttgatg 420ggctactgtg tgaccactgc aaaaggcgct gcttgaaacc ctgaagctga acatggcaat 480tatcctcctc ccagtatgcc ggtaacacca ttactttggt tgagaaatac aagggctgca 540cgggcattgc catttgatga atatattaac ttcccaccag actattgcag cagcatcgtt 600tgttgggtat atcctcccat gccaggaacc tcctgtgttg tgaatttccc acggctataa 660ttcctccaat ga 672 <210> SEQ ID NO 46 <211> LENGTH: 199 <212> TYPE: PRT<213> ORGANISM: Zea mays <400> SEQUENCE: 46 Ala Ser Ala Ala Asn Lys LeuSer Arg Leu Lys Glu Gln Ala Glu Glu 1 5 10 15 Tyr Ala Ala Leu Ile MetGlu Glu Leu Asp Pro Glu Gly Leu Gly Tyr 20 25 30 Ile Glu Leu Trp Gln LeuGlu Thr Leu Leu Leu Gln Lys Asp Thr Tyr 35 40 45 Met Asn Tyr Ser Gln AlaLeu Ser Tyr Thr Ser Gln Ala Leu Ser Gln 50 55 60 Asn Leu Ala Gly Leu ArgLys Lys Ser Pro Ile Arg Lys Ile Ser Thr 65 70 75 80 Thr Leu Ser Tyr TyrLeu Glu Asp Asn Trp Lys Arg Leu Trp Val Leu 85 90 95 Ala Leu Trp Ile GlyIle Met Ala Gly Leu Phe Thr Trp Lys Phe Met 100 105 110 Gln Tyr Arg AsnArg Tyr Val Phe Asn Leu Met Gly Tyr Cys Val Thr 115 120 125 Thr Ala LysAla Leu Leu Glu Thr Leu Lys Leu Asn Met Ala Ile Ile 130 135 140 Leu LeuPro Val Cys Arg Leu Leu Trp Leu Arg Asn Thr Arg Ala Ala 145 150 155 160Arg Ala Leu Pro Phe Asp Glu Tyr Ile Asn Phe Pro Pro Asp Tyr Cys 165 170175 Ser Ser Ile Leu Leu Gly Ile Ser Ser His Ala Arg Asn Leu Leu Cys 180185 190 Cys Glu Phe Pro Thr Ala Ile 195 <210> SEQ ID NO 47 <211> LENGTH:432 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <220> FEATURE: <221>NAME/KEY: unsure <222> LOCATION: (346) <221> NAME/KEY: unsure <222>LOCATION: (351) <221> NAME/KEY: unsure <222> LOCATION: (368) <221>NAME/KEY: unsure <222> LOCATION: (399)..(400) <221> NAME/KEY: unsure<222> LOCATION: (406) <221> NAME/KEY: unsure <222> LOCATION: (430) <400>SEQUENCE: 47 accttgctgg tctaaggaaa aggagtccaa tccggaaaat aagcaccaaattaagctact 60 atctggagga caactggaag cgcctgtggg tacttgcact gtggattgggataatggctg 120 ggttgttcat ttggaaattc atacaatacc gccaccgata tgtctttaacgtgatgggct 180 actgtgtaac aactgcaaaa ggggctgctg agactcttaa gctgaatatggctattatcc 240 tcctgccagt atgcccgcaa caccatcact tggttgagga atacaagggctgcacgggca 300 ttgccgttcg atgacaacat caacttccac aaagactatt gcaacnacaanttgtgggtt 360 ggggttancc ttcaaggggg gggccccaac cttggtagnn tatttncccccgggctccaa 420 agggttccan cg 432 <210> SEQ ID NO 48 <211> LENGTH: 110<212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 48 Leu AlaGly Leu Arg Lys Arg Ser Pro Ile Arg Lys Ile Ser Thr Lys 1 5 10 15 LeuSer Tyr Tyr Leu Glu Asp Asn Trp Lys Arg Leu Trp Val Leu Ala 20 25 30 LeuTrp Ile Gly Ile Met Ala Gly Leu Phe Ile Trp Lys Phe Ile Gln 35 40 45 TyrArg His Arg Tyr Val Phe Asn Val Met Gly Tyr Cys Val Thr Thr 50 55 60 AlaLys Gly Ala Ala Glu Thr Leu Lys Leu Asn Met Ala Ile Ile Leu 65 70 75 80Leu Pro Val Cys Arg Asn Thr Ile Thr Trp Leu Arg Asn Thr Arg Ala 85 90 95Ala Arg Ala Leu Pro Phe Asp Asp Asn Ile Asn Phe His Lys 100 105 110<210> SEQ ID NO 49 <211> LENGTH: 558 <212> TYPE: DNA <213> ORGANISM:Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (485)<221> NAME/KEY: unsure <222> LOCATION: (504) <221> NAME/KEY: unsure<222> LOCATION: (534) <221> NAME/KEY: unsure <222> LOCATION: (540) <221>NAME/KEY: unsure <222> LOCATION: (546) <221> NAME/KEY: unsure <222>LOCATION: (551) <221> NAME/KEY: unsure <222> LOCATION: (557)..(558)<400> SEQUENCE: 49 agacactgaa gactacaaac gcttattttt actgggtaacaagagaacaa ggttctttcg 60 attggtttaa aggggtcatg aatgaagttg ctgaacttgatcaaaggggt gttattgaga 120 tgcacaacta cttaactagc gtatatgagg aaggcgatgccagatccgcc cttattacaa 180 tggtacaagc cctcaaccat gcaaaaaatg gagttgatattgtttctggg actagggtaa 240 gcctttgatt tttgcccttt actttaccct ttatgcgttttagttttgga aaaatagaag 300 tatcctaaag agtttgcttt tcagtcttga gtaagttgattttaaacctt gatttgcttc 360 tctgaattta ctatcaggtg agaactcatt ttgccagggcctaactggaa gaaggtttct 420 caagaatatg ctctaaagca ctgcaatgga cgaataaggggtttcctaat tggtggggca 480 ccggntttgg ccaaaagaat ttancaagct cctgctccgaggtcaacgga aaanggcaan 540 caaaantttg ngtcccnn 558 <210> SEQ ID NO 50<211> LENGTH: 78 <212> TYPE: PRT <213> ORGANISM: Glycine max <400>SEQUENCE: 50 Leu Lys Thr Thr Asn Ala Tyr Phe Tyr Trp Val Thr Arg Glu GlnGly 1 5 10 15 Ser Phe Asp Trp Phe Lys Gly Val Met Asn Glu Val Ala GluLeu Asp 20 25 30 Gln Arg Gly Val Ile Glu Met His Asn Tyr Leu Thr Ser ValTyr Glu 35 40 45 Glu Gly Asp Ala Arg Ser Ala Leu Ile Thr Met Val Gln AlaLeu Asn 50 55 60 His Ala Lys Asn Gly Val Asp Ile Val Ser Gly Thr Arg Val65 70 75 <210> SEQ ID NO 51 <211> LENGTH: 2224 <212> TYPE: DNA <213>ORGANISM: Zea mays <400> SEQUENCE: 51 ccacgcgtcc gctgccaata agctgtccaggcttaaggag caagccgaag agtatgccgc 60 cctgattatg gaggagcttg atcctgaaggacttggctac attgagttgt ggcagttgga 120 gacccttctg ttgcagaagg atacctacatgaactacagt caggcgctaa gttacacaag 180 ccaagcacta agtcagaacc tagcaggcttaaggaaaaag agtccaattc gcaagataag 240 taccacattg agttactatt tggaggataattggaaacgc ctatgggtgc ttgcattatg 300 gattggaata atggctggac tgttcacttggaagttcatg cagtatcgca acaggtatgt 360 ctttaatgtg atgggctact gtgtgaccactgcaaaaggc gctgctgaaa ccctgaagct 420 gaacatggca attatcctcc tcccagtatgccgtaacacc attacttggt tgagaaatac 480 aagggctgca cgggcattgc catttgatgataatattaac ttccacaaga ctattgcagc 540 agcaatcgtt gttggtataa tcctccatgcagggaaccat cttgtgtgtg attttccacg 600 gcttataaat tcatcaaatg agaaatatgctcctctgggc cagtacttcg gtgaaacaaa 660 gccaacatac tttacattgg taggagtggaaggcatcact ggtgtaatta tggttatctg 720 tatgattatt gctttcactt tagctacacggtggttccgt cgtagcctgg tgaagcttcc 780 aaaaccattt gacaaactga ctggcttcaatgccttctgg tactctcatc atttattcat 840 aattgtgtat ttggctctca ttgttcatggccagttcctt tacctcattc atgtctggta 900 ccgaaaaacg acatggatgt atctggcagtgcctgtttgc ctgtatgtag gggagagggt 960 gctgaggttc ttcaggtctg gcagttattctgtcaggcta ttgaaggtgg ccatatatcc 1020 tggtaatgtg ttgacactgc aaatgtccaagcctccggct ttccgataca agagtgggca 1080 atatatgttt gttcaatgtc cagcagtgtcaccctttgaa tggcatccgt tctcaattac 1140 ttcagcgcct ggggatgatt atctaagcattcatgttcga caacttggtg actggacacg 1200 agagctcaag agggtttttg cagcagcttgtgaaccccca gtgggcggta aaagtggcct 1260 tcttagagca gacgagacaa ccaagaaagccttaccaaaa cttctgattg atggacctta 1320 tggttctcct gctcaggatt acagcaagtatgatgttctt ctacttgttg gattagggat 1380 tggtgcaact ccttttatca gcatactgaaagatcttctt aacaatatta taaggatgga 1440 ggaagaggag gatgcttcca ctgatctttatcctccagtt ggtccgaaca agccgcacat 1500 tgatctcagc acacttatga cggtcacatcaagaccaaag agggttctaa ggaccacaaa 1560 cgcttacttc tattgggtta cacgtgagcaaggttctttt gattggttca agggagtcat 1620 gaatgaaatt gctgaactgg accaaaggaatattattgag atgcacaact accttacgag 1680 tgtttatgag gaaggagatg ctcggtcagcgctcatcacc atgcttcaag ctttgaacca 1740 tgcaaagaat ggcgttgaca ttgtctctggaactaaagtc cggacacact ttgcaagacc 1800 aaattggaaa aaagtcctat ctaaaattgcctcgaagcat ccatttgcta agataggtgt 1860 attctactgt ggagctccag ttcttgcacaagaactaaac aaactgtgcc atgaattcaa 1920 tgggaaaagc acaacgaaat ttgagttccacaaggaacat ttctgaattc cagtaatact 1980 tggtgcgcag aaatgatata ttatgtgtgtgtacagcatt tttgtcttgg tatgttcatc 2040 tggagttctg gacgaacatg agagcaaagtggagaagcta ccatcaaatg cttaacagct 2100 gacggcctgc actctttgtt aactgctcctaaatgaacca gaccaccaag aagtgggatt 2160 tttgtgtaca taaaaaaaaa aaaaaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220 aaag 2224 <210> SEQ ID NO 52 <211>LENGTH: 654 <212> TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 52His Ala Ser Ala Ala Asn Lys Leu Ser Arg Leu Lys Glu Gln Ala Glu 1 5 1015 Glu Tyr Ala Ala Leu Ile Met Glu Glu Leu Asp Pro Glu Gly Leu Gly 20 2530 Tyr Ile Glu Leu Trp Gln Leu Glu Thr Leu Leu Leu Gln Lys Asp Thr 35 4045 Tyr Met Asn Tyr Ser Gln Ala Leu Ser Tyr Thr Ser Gln Ala Leu Ser 50 5560 Gln Asn Leu Ala Gly Leu Arg Lys Lys Ser Pro Ile Arg Lys Ile Ser 65 7075 80 Thr Thr Leu Ser Tyr Tyr Leu Glu Asp Asn Trp Lys Arg Leu Trp Val 8590 95 Leu Ala Leu Trp Ile Gly Ile Met Ala Gly Leu Phe Thr Trp Lys Phe100 105 110 Met Gln Tyr Arg Asn Arg Tyr Val Phe Asn Val Met Gly Tyr CysVal 115 120 125 Thr Thr Ala Lys Gly Ala Ala Glu Thr Leu Lys Leu Asn MetAla Ile 130 135 140 Ile Leu Leu Pro Val Cys Arg Asn Thr Ile Thr Trp LeuArg Asn Thr 145 150 155 160 Arg Ala Ala Arg Ala Leu Pro Phe Asp Asp AsnIle Asn Phe His Lys 165 170 175 Thr Ile Ala Ala Ala Ile Val Val Gly IleIle Leu His Ala Gly Asn 180 185 190 His Leu Val Cys Asp Phe Pro Arg LeuIle Asn Ser Ser Asn Glu Lys 195 200 205 Tyr Ala Pro Leu Gly Gln Tyr PheGly Glu Thr Lys Pro Thr Tyr Phe 210 215 220 Thr Leu Val Gly Val Glu GlyIle Thr Gly Val Ile Met Val Ile Cys 225 230 235 240 Met Ile Ile Ala PheThr Leu Ala Thr Arg Trp Phe Arg Arg Ser Leu 245 250 255 Val Lys Leu ProLys Pro Phe Asp Lys Leu Thr Gly Phe Asn Ala Phe 260 265 270 Trp Tyr SerHis His Leu Phe Ile Ile Val Tyr Leu Ala Leu Ile Val 275 280 285 His GlyGln Phe Leu Tyr Leu Ile His Val Trp Tyr Arg Lys Thr Thr 290 295 300 TrpMet Tyr Leu Ala Val Pro Val Cys Leu Tyr Val Gly Glu Arg Val 305 310 315320 Leu Arg Phe Phe Arg Ser Gly Ser Tyr Ser Val Arg Leu Leu Lys Val 325330 335 Ala Ile Tyr Pro Gly Asn Val Leu Thr Leu Gln Met Ser Lys Pro Pro340 345 350 Ala Phe Arg Tyr Lys Ser Gly Gln Tyr Met Phe Val Gln Cys ProAla 355 360 365 Val Ser Pro Phe Glu Trp His Pro Phe Ser Ile Thr Ser AlaPro Gly 370 375 380 Asp Asp Tyr Leu Ser Ile His Val Arg Gln Leu Gly AspTrp Thr Arg 385 390 395 400 Glu Leu Lys Arg Val Phe Ala Ala Ala Cys GluPro Pro Val Gly Gly 405 410 415 Lys Ser Gly Leu Leu Arg Ala Asp Glu ThrThr Lys Lys Ala Leu Pro 420 425 430 Lys Leu Leu Ile Asp Gly Pro Tyr GlySer Pro Ala Gln Asp Tyr Ser 435 440 445 Lys Tyr Asp Val Leu Leu Leu ValGly Leu Gly Ile Gly Ala Thr Pro 450 455 460 Phe Ile Ser Ile Leu Lys AspLeu Leu Asn Asn Ile Ile Arg Met Glu 465 470 475 480 Glu Glu Glu Asp AlaSer Thr Asp Leu Tyr Pro Pro Val Gly Pro Asn 485 490 495 Lys Pro His IleAsp Leu Ser Thr Leu Met Thr Val Thr Ser Arg Pro 500 505 510 Lys Arg ValLeu Arg Thr Thr Asn Ala Tyr Phe Tyr Trp Val Thr Arg 515 520 525 Glu GlnGly Ser Phe Asp Trp Phe Lys Gly Val Met Asn Glu Ile Ala 530 535 540 GluLeu Asp Gln Arg Asn Ile Ile Glu Met His Asn Tyr Leu Thr Ser 545 550 555560 Val Tyr Glu Glu Gly Asp Ala Arg Ser Ala Leu Ile Thr Met Leu Gln 565570 575 Ala Leu Asn His Ala Lys Asn Gly Val Asp Ile Val Ser Gly Thr Lys580 585 590 Val Arg Thr His Phe Ala Arg Pro Asn Trp Lys Lys Val Leu SerLys 595 600 605 Ile Ala Ser Lys His Pro Phe Ala Lys Ile Gly Val Phe TyrCys Gly 610 615 620 Ala Pro Val Leu Ala Gln Glu Leu Asn Lys Leu Cys HisGlu Phe Asn 625 630 635 640 Gly Lys Ser Thr Thr Lys Phe Glu Phe His LysGlu His Phe 645 650 <210> SEQ ID NO 53 <211> LENGTH: 2480 <212> TYPE:DNA <213> ORGANISM: Oryza sativa <400> SEQUENCE: 53 cactaagccagaaccttgct ggtctaagga aaaggagtcc aatccggaaa ataagcacca 60 aattaagctactatctggag gacaactgga agcgcctgtg ggtacttgca ctgtggattg 120 ggataatggctgggttgttc atttggaaat tcatacaata ccgccaccga tatgtcttta 180 acgtgatgggctactgtgta acaactgcaa aaggggctgc tgagactctt aagctgaata 240 tggctattatcctcctgcca gtatgccgca acaccatcac ttggttgagg aatacaaggg 300 ctgcacgggcattgccgttc gatgacaaca tcaacttcca caagactatt gcagcagcaa 360 ttgtggttggtgttatcctt catggagggc tccatcttgt atgtgatttt ccacggctca 420 taggttcatcggaggagaag tatgctccac tagggaagta ttttggtgaa actaagccaa 480 catatttgaccctggtcaaa ggagtggagg gcataactgg ggtaatcatg cttgtgtgca 540 tgattatcgcttttactctt gcaaccaggt ggttccgccg tagcctggtg aagcttccaa 600 agccatttgacaaattaact ggcttcaacg ctttctggta ttctcaccat ctattcatca 660 ttgtgtacatatcacttgta attcatggag agtggctata ccttatccgc atatggtaca 720 aaaggacgacatggatgtat cttgcagtgc ccgtttgttt gtatgtaggg gagaggacac 780 tgaggttcttcaggtctggc agttattctg tccgtctgtt gaaggtggcc atctatcctg 840 gtaatgttttaacactgcag atgtctaagc ctcccacatt ccgttataag agtgggcagt 900 atatgtttgttcaatgtcca gctgtttcac cctttgaatg gcatcccttc tcaataactt 960 cggcacctggggatgactat ctcagcattc atgttcggca acttggtgac tggacaagag 1020 agctcaagagggtgttctca gcggcttgtg agccaccagt gggtgggaaa agtggtctcc 1080 ttagagcagatgagactacc aagaaagcct tacccaaatt gttgattgat ggtccatatg 1140 gctctcctgcgcaagactac agcaagtatg atgttttact gctggttgga ttaggaattg 1200 gcgcaacgccttttatcagc atattgaaag accttatcaa cagcatcatc aaaatggagg 1260 aagaggaagaagcttcaggt gatctttacc cgccaatcgg acgcaataaa gcacatgttg 1320 atcttgacacccttatgagg attacctcaa aaccaaagag ggttttgaag acaacaaatg 1380 cttatttttattgggtgaca cgtgaacaag gctcttttga ttggtttaaa ggagtcatga 1440 atgagattgctgaactagat caaaggaata ttattgagat gcacaactac ctcacaagtg 1500 tttacgaggaaggggatgct cggtcagcac tcattactat gctgcaagct ctaaaccatg 1560 ccaagaatggtgttgatata gtatctggaa ctaaagtccg tacacatttt gcaaggccaa 1620 attttaagaaggttctttct aagatagcct ccaaacatcc ttatgctaaa ataggagtat 1680 tctactgtggggctccagtt ctggctcagg aattaagcga tctttgccat gattttaatg 1740 gcagatgcacgtcaaaattt gagtttcaca aggagcattt ctaaaatcag gtaaatgcca 1800 aatagggaaataatagaaga ttgtgctact tgtatgacac cttttctttt acacgatgcc 1860 ttgttcacaatggaattaat agcaagggcg agaagcatag aggtgctatc acaacgcacc 1920 ctcctggctacggccagttt gcactgtaca cttcttgctt gagatactca gaagatacaa 1980 taagagtgccaagataagag ttttggggat catctggtac aaaatatata ggacatcaat 2040 catggggaaacataggttcc aacttccaag tgcatcggcg aggtgattgc tcgtaacatg 2100 ttgatctcctggatttggat ttggttttcg actataatct ggtgtaggac atagaaggct 2160 accatgagcacaaggctaat taattgtata ttgcgtgcag ctgagtggtc aggctacaag 2220 atttgctctaactttgtaat tatatacaga gatgggaaac aaacaaatat agtgtgattt 2280 ttgtaagtagtgtaggttgg tagtgattgg gtccagattg gtggctcaga ttacattcaa 2340 gcttgtttttttgttgtgcc gggagtgtac aacgtgcatt gttttaggga tggttttaca 2400 cacatacagatgatgtaatc atgtaatcct tgtagcaaag gccttgttct tacgttggaa 2460 aaaaaaaaaaaaaaaaaaaa 2480 <210> SEQ ID NO 54 <211> LENGTH: 593 <212> TYPE: PRT<213> ORGANISM: Oryza sativa <400> SEQUENCE: 54 Leu Ser Gln Asn Leu AlaGly Leu Arg Lys Arg Ser Pro Ile Arg Lys 1 5 10 15 Ile Ser Thr Lys LeuSer Tyr Tyr Leu Glu Asp Asn Trp Lys Arg Leu 20 25 30 Trp Val Leu Ala LeuTrp Ile Gly Ile Met Ala Gly Leu Phe Ile Trp 35 40 45 Lys Phe Ile Gln TyrArg His Arg Tyr Val Phe Asn Val Met Gly Tyr 50 55 60 Cys Val Thr Thr AlaLys Gly Ala Ala Glu Thr Leu Lys Leu Asn Met 65 70 75 80 Ala Ile Ile LeuLeu Pro Val Cys Arg Asn Thr Ile Thr Trp Leu Arg 85 90 95 Asn Thr Arg AlaAla Arg Ala Leu Pro Phe Asp Asp Asn Ile Asn Phe 100 105 110 His Lys ThrIle Ala Ala Ala Ile Val Val Gly Val Ile Leu His Gly 115 120 125 Gly LeuHis Leu Val Cys Asp Phe Pro Arg Leu Ile Gly Ser Ser Glu 130 135 140 GluLys Tyr Ala Pro Leu Gly Lys Tyr Phe Gly Glu Thr Lys Pro Thr 145 150 155160 Tyr Leu Thr Leu Val Lys Gly Val Glu Gly Ile Thr Gly Val Ile Met 165170 175 Leu Val Cys Met Ile Ile Ala Phe Thr Leu Ala Thr Arg Trp Phe Arg180 185 190 Arg Ser Leu Val Lys Leu Pro Lys Pro Phe Asp Lys Leu Thr GlyPhe 195 200 205 Asn Ala Phe Trp Tyr Ser His His Leu Phe Ile Ile Val TyrIle Ser 210 215 220 Leu Val Ile His Gly Glu Trp Leu Tyr Leu Ile Arg IleTrp Tyr Lys 225 230 235 240 Arg Thr Thr Trp Met Tyr Leu Ala Val Pro ValCys Leu Tyr Val Gly 245 250 255 Glu Arg Thr Leu Arg Phe Phe Arg Ser GlySer Tyr Ser Val Arg Leu 260 265 270 Leu Lys Val Ala Ile Tyr Pro Gly AsnVal Leu Thr Leu Gln Met Ser 275 280 285 Lys Pro Pro Thr Phe Arg Tyr LysSer Gly Gln Tyr Met Phe Val Gln 290 295 300 Cys Pro Ala Val Ser Pro PheGlu Trp His Pro Phe Ser Ile Thr Ser 305 310 315 320 Ala Pro Gly Asp AspTyr Leu Ser Ile His Val Arg Gln Leu Gly Asp 325 330 335 Trp Thr Arg GluLeu Lys Arg Val Phe Ser Ala Ala Cys Glu Pro Pro 340 345 350 Val Gly GlyLys Ser Gly Leu Leu Arg Ala Asp Glu Thr Thr Lys Lys 355 360 365 Ala LeuPro Lys Leu Leu Ile Asp Gly Pro Tyr Gly Ser Pro Ala Gln 370 375 380 AspTyr Ser Lys Tyr Asp Val Leu Leu Leu Val Gly Leu Gly Ile Gly 385 390 395400 Ala Thr Pro Phe Ile Ser Ile Leu Lys Asp Leu Ile Asn Ser Ile Ile 405410 415 Lys Met Glu Glu Glu Glu Glu Ala Ser Gly Asp Leu Tyr Pro Pro Ile420 425 430 Gly Arg Asn Lys Ala His Val Asp Leu Asp Thr Leu Met Arg IleThr 435 440 445 Ser Lys Pro Lys Arg Val Leu Lys Thr Thr Asn Ala Tyr PheTyr Trp 450 455 460 Val Thr Arg Glu Gln Gly Ser Phe Asp Trp Phe Lys GlyVal Met Asn 465 470 475 480 Glu Ile Ala Glu Leu Asp Gln Arg Asn Ile IleGlu Met His Asn Tyr 485 490 495 Leu Thr Ser Val Tyr Glu Glu Gly Asp AlaArg Ser Ala Leu Ile Thr 500 505 510 Met Leu Gln Ala Leu Asn His Ala LysAsn Gly Val Asp Ile Val Ser 515 520 525 Gly Thr Lys Val Arg Thr His PheAla Arg Pro Asn Phe Lys Lys Val 530 535 540 Leu Ser Lys Ile Ala Ser LysHis Pro Tyr Ala Lys Ile Gly Val Phe 545 550 555 560 Tyr Cys Gly Ala ProVal Leu Ala Gln Glu Leu Ser Asp Leu Cys His 565 570 575 Asp Phe Asn GlyArg Cys Thr Ser Lys Phe Glu Phe His Lys Glu His 580 585 590 Phe <210>SEQ ID NO 55 <211> LENGTH: 1255 <212> TYPE: DNA <213> ORGANISM: Glycinemax <400> SEQUENCE: 55 gcacgagaga cactgaagac tacaaacgct tatttttactgggtaacaag agaacaaggt 60 tctttcgatt ggtttaaagg ggtcatgaat gaagttgctgaacttgatca aaggggtgtt 120 attgagatgc acaactactt aactagcgta tatgaggaaggcgatgccag atccgccctt 180 attacaatgg tacaagccct caaccatgca aaaaatggagttgatattgt ttctgggact 240 agggtaagcc tttgattttt gccctttact ttaccctttatgcgttttag ttttggaaaa 300 atagaagtat cctaaagagt ttgcttttca gtcttgagtaagttgatttt aaaccttgat 360 ttgcttctct gaatttacta tcaggtgaga actcattttgccaggcctaa ctggaagaag 420 gttttctcaa gaatatgctc taagcactgc aatggacgaataggggtttt ctattgtggc 480 gcaccggttt tggccaaaga attaagcaag ctctgcttcgagttcaacga aaaaggtcaa 540 acaaaatttg agttccacaa ggagcatttc taagggaatttggagggagc ttcatatctg 600 tatctagaat caacacaatt aattttagca actagtcgtagtcaattata agatatggtt 660 caataagcta aaaccttaag tatagaatat gaggcattgtacatcccggc cggccggcaa 720 gatggctggg cattgatcat acatgtggct gcagctcaaaggagatattg cccttacaag 780 gctccagcaa cttcagatac attctgccga accgcatagattattggccg atacatccca 840 tagaaaatat acaacagttt tgaaagtgat aaaaaggaattagttagtgt tctgcctcag 900 gagcaatgca ttggtttgtg gataatttat ttttttaaaaattatcgttt tctaatattt 960 attttaaaaa atatacttga aaatgatttt ttttttttaagtaaatgtca acaaacatat 1020 accaaatttt cgttttcttc gagtggattc cattatctttgtggaatatt gtgggtgaaa 1080 ctattgtttc gaccctgctg cttctgcttg gatttcttctccgggagagg agatctaatg 1140 tctaaacact gtacatgaga tgagactctt aatcgtatgttgactatgtt ctttttctta 1200 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacaaaaaaaaaaaaa aaaaa 1255 <210> SEQ ID NO 56 <211> LENGTH: 143 <212> TYPE:PRT <213> ORGANISM: Glycine max <400> SEQUENCE: 56 Ala Arg Glu Thr LeuLys Thr Thr Asn Ala Tyr Phe Tyr Trp Val Thr 1 5 10 15 Arg Glu Gln GlySer Phe Asp Trp Phe Lys Gly Val Met Asn Glu Val 20 25 30 Ala Glu Leu AspGln Arg Gly Val Ile Glu Met His Asn Tyr Leu Thr 35 40 45 Ser Val Tyr GluGlu Gly Asp Ala Arg Ser Ala Leu Ile Thr Met Val 50 55 60 Gln Ala Leu AsnHis Ala Lys Asn Gly Val Asp Ile Val Ser Gly Thr 65 70 75 80 Arg Val ArgThr His Phe Ala Arg Pro Asn Trp Lys Lys Val Phe Ser 85 90 95 Arg Ile CysSer Lys His Cys Asn Gly Arg Ile Gly Val Phe Tyr Cys 100 105 110 Gly AlaPro Val Leu Ala Lys Glu Leu Ser Lys Leu Cys Phe Glu Phe 115 120 125 AsnGlu Lys Gly Gln Thr Lys Phe Glu Phe His Lys Glu His Phe 130 135 140<210> SEQ ID NO 57 <211> LENGTH: 496 <212> TYPE: DNA <213> ORGANISM: Zeamays <400> SEQUENCE: 57 gtctccttca cgctggccac gcacccgttg cgcaagggggagcccaaggg cgcgggcgcg 60 gccgccggca cgtcgcggct cccggcgcca ctgaaccggctcaccggctt caacgccttc 120 tggtactcgc accacctcct cggcatcgtg tacgcgctcctgctcgcgca cggctacttc 180 ctcttcctcg tccggaggtg gtacgagaag acgacatggatgtacatttc ggtcccgctg 240 ctgctctacg tcggtgaaag gatgcttaga gccttgaggtcaaatgctta taccgtgaaa 300 attcttaagg tgtgtcttct acctgggaat gtgttgaccataacgatgtc aaagccctat 360 ggatttcgtt acagaagtgg acagtacata tttcttcaatgcccaataat ttctccattc 420 gaatggatcc tttctccatc acttcagcac ctggagatgactacctaagt gtcacatccg 480 aacaaacgtg actgga 496 <210> SEQ ID NO 58 <211>LENGTH: 133 <212> TYPE: PRT <213> ORGANISM: Zea mays <220> FEATURE:<221> NAME/KEY: UNSURE <222> LOCATION: (118) <400> SEQUENCE: 58 Arg LeuPro Ala Pro Leu Asn Arg Leu Thr Gly Phe Asn Ala Phe Trp 1 5 10 15 TyrSer His His Leu Leu Gly Ile Val Tyr Ala Leu Leu Leu Ala His 20 25 30 GlyTyr Phe Leu Phe Leu Val Arg Arg Trp Tyr Glu Lys Thr Thr Trp 35 40 45 MetTyr Ile Ser Val Pro Leu Leu Leu Tyr Val Gly Glu Arg Met Leu 50 55 60 ArgAla Leu Arg Ser Asn Ala Tyr Thr Val Lys Ile Leu Lys Val Cys 65 70 75 80Leu Leu Pro Gly Asn Val Leu Thr Ile Thr Met Ser Lys Pro Tyr Gly 85 90 95Phe Arg Tyr Arg Ser Gly Gln Tyr Ile Phe Leu Gln Cys Pro Ile Ile 100 105110 Ser Pro Phe Glu Trp Xaa Pro Phe Ser Ile Thr Ser Ala Pro Gly Asp 115120 125 Asp Tyr Leu Ser Val 130 <210> SEQ ID NO 59 <211> LENGTH: 571<212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221>NAME/KEY: unsure <222> LOCATION: (293) <221> NAME/KEY: unsure <222>LOCATION: (303) <221> NAME/KEY: unsure <222> LOCATION: (308) <221>NAME/KEY: unsure <222> LOCATION: (322) <221> NAME/KEY: unsure <222>LOCATION: (332) <221> NAME/KEY: unsure <222> LOCATION: (350) <221>NAME/KEY: unsure <222> LOCATION: (353) <221> NAME/KEY: unsure <222>LOCATION: (356) <221> NAME/KEY: unsure <222> LOCATION: (369) <221>NAME/KEY: unsure <222> LOCATION: (378) <221> NAME/KEY: unsure <222>LOCATION: (385) <221> NAME/KEY: unsure <222> LOCATION: (389) <221>NAME/KEY: unsure <222> LOCATION: (403) <221> NAME/KEY: unsure <222>LOCATION: (526) <400> SEQUENCE: 59 ccaaatatat gatctattcc aagaggcagtgttatcacga tcacaagggt gtccaaaact 60 gtacatagat ggcccttatg gttctgctgctcaagaccat gtaaagtatg acattctagt 120 actaattggt cttggcatag gagccacacctttcattagc atccttaaag atgtagttaa 180 aggtgtccag acaacgcaaa atgatcatagtggtctcaga caatgcagct taacaaaagg 240 tccattaaaa gcatatcttt attgggttacaagagagcca aactcttttg gancgggttt 300 aanatgtnat gaagggaatc cncaatttcaanccaaaaag cattcggttn tgnagntgca 360 catttcccng acaagtgnac atccngaangggggatattc gtncagcttt gctaaagggt 420 aatccgggcc ttgcaatctt ggccaagaatgggcctgaca tagtttccaa gggcttccca 480 tacacacaca tttttgctcg gaccaaattgggtccaacat atttcnagga ttggctcgca 540 agcatgggag gagctaagat tggggggttt c571 <210> SEQ ID NO 60 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM:Glycine max <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (98)<221> NAME/KEY: UNSURE <222> LOCATION: (101) <221> NAME/KEY: UNSURE<222> LOCATION: (103) <221> NAME/KEY: UNSURE <222> LOCATION: (111) <221>NAME/KEY: UNSURE <222> LOCATION: (117)..(118)..(119) <221> NAME/KEY:UNSURE <222> LOCATION: (123) <221> NAME/KEY: UNSURE <222> LOCATION:(126) <221> NAME/KEY: UNSURE <222> LOCATION: (130) <400> SEQUENCE: 60Gln Ile Tyr Asp Leu Phe Gln Glu Ala Val Leu Ser Arg Ser Gln Gly 1 5 1015 Cys Pro Lys Leu Tyr Ile Asp Gly Pro Tyr Gly Ser Ala Ala Gln Asp 20 2530 His Val Lys Tyr Asp Ile Leu Val Leu Ile Gly Leu Gly Ile Gly Ala 35 4045 Thr Pro Phe Ile Ser Ile Leu Lys Asp Val Val Lys Gly Val Gln Thr 50 5560 Thr Gln Asn Asp His Ser Gly Leu Arg Gln Cys Ser Leu Thr Lys Gly 65 7075 80 Pro Leu Lys Ala Tyr Leu Tyr Trp Val Thr Arg Glu Pro Asn Ser Phe 8590 95 Gly Xaa Gly Leu Xaa Cys Xaa Glu Gly Asn Pro Gln Phe Gln Xaa Lys100 105 110 Lys His Ser Val Xaa Xaa Xaa His Ile Ser Xaa Thr Ser Xaa HisPro 115 120 125 Glu Xaa Gly Ile Phe Val Gln Leu Cys 130 135 <210> SEQ IDNO 61 <211> LENGTH: 493 <212> TYPE: DNA <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (466)<221> NAME/KEY: unsure <222> LOCATION: (491) <400> SEQUENCE: 61ccacctcctc ggcttcgtct acctcctcct cctcgcccac ggctacttcc tcttcctcgt 60ccgccgctgg tacgagaaaa cgacatggat gtacatttct gtccctctgg tgctctatgt 120cggcgaaagg atgctgcgag ccttgcggtc gaatgctcac cctgtcaaaa tcctcaaggt 180gttgcttcta cctggaagtg tactgacaat acaaatgtca aagccctacg gatttcgata 240taggagtgga caatatatct ttcttcagtg tccgatgatc tctccatttg aatggcatcc 300tttctccatc acctcagctc ctggagatga ctacctcgct gttcacattc gcacaaacgg 360agactggacg caagagctca agcgcatatt tgtcgagaac tacttcacgc cgcacatgaa 420cagaagaact caatcagcga gttaggcgcg gcagaaccta gaccantccc gcacaaagtt 480gctcgtagat ngg 493 <210> SEQ ID NO 62 <211> LENGTH: 129 <212> TYPE: PRT<213> ORGANISM: Triticum aestivum <400> SEQUENCE: 62 His Leu Leu Gly PheVal Tyr Leu Leu Leu Leu Ala His Gly Tyr Phe 1 5 10 15 Leu Phe Leu ValArg Arg Trp Tyr Glu Lys Thr Thr Trp Met Tyr Ile 20 25 30 Ser Val Pro LeuVal Leu Tyr Val Gly Glu Arg Met Leu Arg Ala Leu 35 40 45 Arg Ser Asn AlaHis Pro Val Lys Ile Leu Lys Val Leu Leu Leu Pro 50 55 60 Gly Ser Val LeuThr Ile Gln Met Ser Lys Pro Tyr Gly Phe Arg Tyr 65 70 75 80 Arg Ser GlyGln Tyr Ile Phe Leu Gln Cys Pro Met Ile Ser Pro Phe 85 90 95 Glu Trp HisPro Phe Ser Ile Thr Ser Ala Pro Gly Asp Asp Tyr Leu 100 105 110 Ala ValHis Ile Arg Thr Asn Gly Asp Trp Thr Gln Glu Leu Lys Arg 115 120 125 Ile<210> SEQ ID NO 63 <211> LENGTH: 1703 <212> TYPE: DNA <213> ORGANISM:Zea mays <400> SEQUENCE: 63 gcacgaggtc tccttcacgc tggccacgca cccgttgcgcaagggggagc ccaagggcgc 60 gggcgcggcc gccggcacgt cgcggctccc ggcgccactgaaccggctca ccggcttcaa 120 cgccttctgg tactcgcacc acctcctcgg catcgtgtacgcgctcctgc tcgcgcacgg 180 ctacttcctc ttcctcgtcc ggaggtggta cgagaagacgacatggatgt acatttcggt 240 cccgctgctg ctctacgtcg gtgaaaggat gcttagagccttgaggtcaa atgcttatac 300 cgtgaaaatt cttaaggtgt gtcttctacc tgggaatgtgttgaccataa cgatgtcaaa 360 gccctatgga tttcgttaca gaagtggaca gtacatatttcttcaatgcc caataatttc 420 tccattcgaa tggcatcctt tctccatcac ttcagcacctggagatgact acctaagtgt 480 ccacatccga acaaacggtg actggacaca ggagctcaagcgcatatttg tggagaacta 540 cttctcgccg catctcaaca gaagagcttc gtttagcgagctaggtgcgg cagaaccaag 600 aagcttgcca aaattactcg tagatggtcc ctatggtgcccctgcgcagg attttagaaa 660 ctacgacgtt ctacttctcg tcggcctcgg aatcggggcaacaccgttca taagcattct 720 acgggatctg cttaataaca ttaagatagc tgacgagctgatggacttgg caatggagac 780 cagcaggtct gaagacagcg ccaacagctt tagcgtctcaacggcgagta gcaaccggaa 840 gagagcatac agaacaagcc gtgcacattt ctactgggtcacccgagaag ccggatcctt 900 tgaatggttc aaaggggtga tgaatgaggt tgcagaaatggacaagaagg gtatcataga 960 gctgcacaat tacctcacta gcgtttacga ggaacgcgacgcacggacaa ctctgctgtc 1020 catggtccag gctctgaacc acgccaagca cggcgtcgatatcgtatcgg ggaccagggt 1080 gaggacgcat ttcgccagac ccaactggaa aggagtcttcaacaagattg cctccaagca 1140 tccgaattca acagttggtg tgttctactg cggcgcaccgacgcttgcca aggagctgaa 1200 ggctctggcg cacgagatga gccacaggac gggcactcgcttccatttcc acaaggagta 1260 cttctgagtt tcgacggata gaatgggcag tcgatgctcgttctaacgtg atctggtctt 1320 ttctgatctg atctcgatag cgttaggtac cggtaaagaagacagaattt tgcacagatg 1380 caagacaaac caagctgacc tctcctcctg atctggaactgtacatagca taaaagaagc 1440 accacagggt tagcttagta tacgtagcac aattcaaataaatatattga caaagaaaag 1500 agagacagag aaagggcagg atagaggctg ttgttgttaactcgaagata ttgatgatag 1560 gcatggcggt atacgtgtat acattatagt taggtgctaggtggtagtta tatatatact 1620 gtatatgata tgggtcgatg aatatacgag gagccacatgctgctgccaa aaaaaaaaaa 1680 aaaaaaaaaa aaaaaaaaaa aaa 1703 <210> SEQ IDNO 64 <211> LENGTH: 421 <212> TYPE: PRT <213> ORGANISM: Zea mays <400>SEQUENCE: 64 His Glu Val Ser Phe Thr Leu Ala Thr His Pro Leu Arg Lys GlyGlu 1 5 10 15 Pro Lys Gly Ala Gly Ala Ala Ala Gly Thr Ser Arg Leu ProAla Pro 20 25 30 Leu Asn Arg Leu Thr Gly Phe Asn Ala Phe Trp Tyr Ser HisHis Leu 35 40 45 Leu Gly Ile Val Tyr Ala Leu Leu Leu Ala His Gly Tyr PheLeu Phe 50 55 60 Leu Val Arg Arg Trp Tyr Glu Lys Thr Thr Trp Met Tyr IleSer Val 65 70 75 80 Pro Leu Leu Leu Tyr Val Gly Glu Arg Met Leu Arg AlaLeu Arg Ser 85 90 95 Asn Ala Tyr Thr Val Lys Ile Leu Lys Val Cys Leu LeuPro Gly Asn 100 105 110 Val Leu Thr Ile Thr Met Ser Lys Pro Tyr Gly PheArg Tyr Arg Ser 115 120 125 Gly Gln Tyr Ile Phe Leu Gln Cys Pro Ile IleSer Pro Phe Glu Trp 130 135 140 His Pro Phe Ser Ile Thr Ser Ala Pro GlyAsp Asp Tyr Leu Ser Val 145 150 155 160 His Ile Arg Thr Asn Gly Asp TrpThr Gln Glu Leu Lys Arg Ile Phe 165 170 175 Val Glu Asn Tyr Phe Ser ProHis Leu Asn Arg Arg Ala Ser Phe Ser 180 185 190 Glu Leu Gly Ala Ala GluPro Arg Ser Leu Pro Lys Leu Leu Val Asp 195 200 205 Gly Pro Tyr Gly AlaPro Ala Gln Asp Phe Arg Asn Tyr Asp Val Leu 210 215 220 Leu Leu Val GlyLeu Gly Ile Gly Ala Thr Pro Phe Ile Ser Ile Leu 225 230 235 240 Arg AspLeu Leu Asn Asn Ile Lys Ile Ala Asp Glu Leu Met Asp Leu 245 250 255 AlaMet Glu Thr Ser Arg Ser Glu Asp Ser Ala Asn Ser Phe Ser Val 260 265 270Ser Thr Ala Ser Ser Asn Arg Lys Arg Ala Tyr Arg Thr Ser Arg Ala 275 280285 His Phe Tyr Trp Val Thr Arg Glu Ala Gly Ser Phe Glu Trp Phe Lys 290295 300 Gly Val Met Asn Glu Val Ala Glu Met Asp Lys Lys Gly Ile Ile Glu305 310 315 320 Leu His Asn Tyr Leu Thr Ser Val Tyr Glu Glu Arg Asp AlaArg Thr 325 330 335 Thr Leu Leu Ser Met Val Gln Ala Leu Asn His Ala LysHis Gly Val 340 345 350 Asp Ile Val Ser Gly Thr Arg Val Arg Thr His PheAla Arg Pro Asn 355 360 365 Trp Lys Gly Val Phe Asn Lys Ile Ala Ser LysHis Pro Asn Ser Thr 370 375 380 Val Gly Val Phe Tyr Cys Gly Ala Pro ThrLeu Ala Lys Glu Leu Lys 385 390 395 400 Ala Leu Ala His Glu Met Ser HisArg Thr Gly Thr Arg Phe His Phe 405 410 415 His Lys Glu Tyr Phe 420<210> SEQ ID NO 65 <211> LENGTH: 2312 <212> TYPE: DNA <213> ORGANISM:Triticum aestivum <400> SEQUENCE: 65 gcacgagcca cctcctcggc ttcgtctacctcctcctcct cgcccacggc tacttcctct 60 tcctcgtccg ccgctggtac gagaaaacgacatggatgta catttctgtc cctctggtgc 120 tctatgtcgg cgaaaggatg ctgcgagccttgcggtcgaa tgctcaccct gtcaaaatcc 180 tcaaggtgtt gcttctacct ggaagtgtactgacaataca aatgtcaaag ccctacggat 240 ttcgatatag gagtggacaa tatatctttcttcagtgtcc gatgatctct ccatttgaat 300 ggcatccttt ctccatcacc tcagctcctggagatgacta cctcgctgtt cacattcgca 360 caaacggaga ctggacgcaa gagctcaagcgcatatttgt cgagaactac ttcacgccgc 420 acatgaacag aagaacttca ttcagcgagttaggcgcggc agaacctaga cccactcccg 480 caccaaagtt gctcgtagat ggtccatatggtgcccctgc acaggatttc agaaactacg 540 atgttctgct tctagttggc cttggaattggagcaacacg ttcataagca ttctgaagga 600 cctacttaac aatattaagc tagctgatgagcttatggac ttggcaatgg agactactca 660 aactagtagg tctgaggaca gtgccaacagcttcagtgtc tcaactgcta gcagcaacag 720 gaagagatca tatagaacaa gccgtgcacatttttactgg gttactcgcg agcccatgtc 780 atttgaatgg ttcaaaggag tgatgaatgaggttgccgaa atggacaaga agggtgtcat 840 agagttgcac aattatctta cgagcgtgtacgaggagcgt gatgcacgaa caactctgtt 900 gtcgatggtc caagctctaa accatgccaaacatggtgtc gacatcgtct cgggcaccag 960 ggtgaggaca cactttgcca ggccaaactggagggaagtc ttcaccaaaa tcgccgccaa 1020 gcagccgaat tcaacagttg gagtgttctactgtggcgct ccgacgctgg ccaaagaact 1080 gaagaacctg tcgcacgaga tgagccacaagacgtcgacg cgcttccatt tccacaagga 1140 gtacttctga tgggaagaag aattgattcttctgatctgc aaaatagcgt taggcaccgg 1200 taaagaagaa ggaatttttt gcacagaagcaaagcaaact gacctctctc tcttcctgat 1260 ctggaactgt acatagtata aagaagaacagcctgttagg ggagggtagt tagtgccatt 1320 tatatattat taacaaaaaa gaaagtaaagcaaggctgct atatccaaaa ggagaaggaa 1380 agaggcaggg taaagagttg ggttaaatttcagatatact aatatacatg tacacagtat 1440 agttaggtag ttacgtatga aatgctagctaaagcatatt tacaagggcc ggaggctgcc 1500 aaaaaaaaaa aaaaaaaaaa aaaaaacggcactgtcaact cccggtggga acacctcggt 1560 cgactccatc aacgccatct tgccggtcatgggttacgcc cacaacactg tcaacaacaa 1620 tgttggctct gcctccatca tcggcggctacgtctaccgc tccatgacta acccatgcct 1680 caacggcagg tacatttacg cggacttgtatgcacagtcc atgtggtcag ggatcgagac 1740 gccggagaac agtggggtgt acaatgtgacgccgctgaca ttcggctgct ccaagacgtc 1800 accaatcctg tgtgatgtcg cggccaagagcccgctgccg tcattgggct acatcttctc 1860 ctttggcgaa gacaacgcca aggacttatacttgcttacc agcaagggag tgtacagggt 1920 ggtcgaccct agcagttgca actacgcatgcccaatcaag agctccacgc aggaaggggt 1980 gcccccgcct acctcgtcgc caagctcggcattcaatgcg caaacttcca ccgtaccaat 2040 gacgttgttg gcaggagtgt tgcttgtcttgctgagcttg ggcttttgag aactattcat 2100 tttaagttca gaatgttttg caaagttgccatatggttat ggtgttttga tgttgtatat 2160 tttggatact ctctctatgt gtgcaagcgtgtaaattttt agattgtatt tatcaaatta 2220 aaagtcaaat atgcgcgaaa tggattgaatctgggatttt cccaattgcg caaaaattgc 2280 gtcttcacta ttttaaaaaa aaaaaaaaaaaa 2312 <210> SEQ ID NO 66 <211> LENGTH: 382 <212> TYPE: PRT <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 66 Thr Ser His Leu Leu GlyPhe Val Tyr Leu Leu Leu Leu Ala His Gly 1 5 10 15 Tyr Phe Leu Phe LeuVal Arg Arg Trp Tyr Glu Lys Thr Thr Trp Met 20 25 30 Tyr Ile Ser Val ProLeu Val Leu Tyr Val Gly Glu Arg Met Leu Arg 35 40 45 Ala Leu Arg Ser AsnAla His Pro Val Lys Ile Leu Lys Val Leu Leu 50 55 60 Leu Pro Gly Ser ValLeu Thr Ile Gln Met Ser Lys Pro Tyr Gly Phe 65 70 75 80 Arg Tyr Arg SerGly Gln Tyr Ile Phe Leu Gln Cys Pro Met Ile Ser 85 90 95 Pro Phe Glu TrpHis Pro Phe Ser Ile Thr Ser Ala Pro Gly Asp Asp 100 105 110 Tyr Leu AlaVal His Ile Arg Thr Asn Gly Asp Trp Thr Gln Glu Leu 115 120 125 Lys ArgIle Phe Val Glu Asn Tyr Phe Thr Pro His Met Asn Arg Arg 130 135 140 ThrSer Phe Ser Glu Leu Gly Ala Ala Glu Pro Arg Pro Thr Pro Ala 145 150 155160 Pro Lys Leu Leu Val Asp Gly Pro Tyr Gly Ala Pro Ala Gln Asp Phe 165170 175 Arg Asn Tyr Asp Val Leu Leu Leu Val Gly Leu Gly Ile Gly Ala Thr180 185 190 Thr Phe Ile Ser Ile Leu Lys Asp Leu Leu Asn Asn Ile Lys LeuAla 195 200 205 Asp Glu Leu Met Asp Leu Ala Met Glu Thr Thr Gln Thr SerArg Ser 210 215 220 Glu Asp Ser Ala Asn Ser Phe Ser Val Ser Thr Ala SerSer Asn Arg 225 230 235 240 Lys Arg Ser Tyr Arg Thr Ser Arg Ala His PheTyr Trp Val Thr Arg 245 250 255 Glu Pro Met Ser Phe Glu Trp Phe Lys GlyVal Met Asn Glu Val Ala 260 265 270 Glu Met Asp Lys Lys Gly Val Ile GluLeu His Asn Tyr Leu Thr Ser 275 280 285 Val Tyr Glu Glu Arg Asp Ala ArgThr Thr Leu Leu Ser Met Val Gln 290 295 300 Ala Leu Asn His Ala Lys HisGly Val Asp Ile Val Ser Gly Thr Arg 305 310 315 320 Val Arg Thr His PheAla Arg Pro Asn Trp Arg Glu Val Phe Thr Lys 325 330 335 Ile Ala Ala LysGln Pro Asn Ser Thr Val Gly Val Phe Tyr Cys Gly 340 345 350 Ala Pro ThrLeu Ala Lys Glu Leu Lys Asn Leu Ser His Glu Met Ser 355 360 365 His LysThr Ser Thr Arg Phe His Phe His Lys Glu Tyr Phe 370 375 380 <210> SEQ IDNO 67 <211> LENGTH: 550 <212> TYPE: DNA <213> ORGANISM: Zea mays <400>SEQUENCE: 67 aagcgccgaa ccgcgttcca ggtgatgggc tactgcgtct gcgtcgccaagggtgccgct 60 gagatcctca agctcaacat ggctctcatc ctgctgcccg tctgccggaatacgctgacg 120 acgctcaggt ccaccgcgct cagccatgtc atacccttcg atgacaacatcaacttccac 180 aaggtcatcg cgctgtccat cgcgatcgcc acagcgatcc acacgctcgcacacgtgacc 240 tgcgacttcc caaggctgat cagctgcccg acggacaagt tcatggccaccttggggtcc 300 aacttccact acaagcagcc gacttacctg ggcttgctgg agagcacacccggggttacc 360 ggaatcctca tgatcatcat aatgtccttc tccttcacgc tggcaacacattccttcagg 420 cggagtgtgg tgaagctgcc atcgccgcta caccaccttg ccggtttcaatgccttctgg 480 tacgctcaca ctgctggtcc ttgcgttatg ttctgctggt ggtgcactcctacttcatat 540 tctcacagga 550 <210> SEQ ID NO 68 <211> LENGTH: 181 <212>TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 68 Lys Arg Arg ThrAla Phe Gln Val Met Gly Tyr Cys Val Cys Val Ala 1 5 10 15 Lys Gly AlaAla Glu Ile Leu Lys Leu Asn Met Ala Leu Ile Leu Leu 20 25 30 Pro Val CysArg Asn Thr Leu Thr Thr Leu Arg Ser Thr Ala Leu Ser 35 40 45 His Val IlePro Phe Asp Asp Asn Ile Asn Phe His Lys Val Ile Ala 50 55 60 Leu Ser IleAla Ile Ala Thr Ala Ile His Thr Leu Ala His Val Thr 65 70 75 80 Cys AspPhe Pro Arg Leu Ile Ser Cys Pro Thr Asp Lys Phe Met Ala 85 90 95 Thr LeuGly Ser Asn Phe His Tyr Lys Gln Pro Thr Tyr Leu Gly Leu 100 105 110 LeuGlu Ser Thr Pro Gly Val Thr Gly Ile Leu Met Ile Ile Ile Met 115 120 125Ser Phe Ser Phe Thr Leu Ala Thr His Ser Phe Arg Arg Ser Val Val 130 135140 Lys Leu Pro Ser Pro Leu His His Leu Ala Gly Phe Gln Cys Leu Leu 145150 155 160 Val Arg Ser His Cys Trp Ser Leu Arg Tyr Val Leu Leu Val ValHis 165 170 175 Ser Tyr Phe Ile Phe 180 <210> SEQ ID NO 69 <211> LENGTH:525 <212> TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 69tttagatctt tcatctttgt atatatactc tttcttcttc ttgcttctgt tttttttttt 60tgttagctta aagtgtgttg catctgaatt tttatttatt ccgaaggtat gaaggattcg 120aaggaattcg ctctggaact gtttgatgct ctgagtcgta aacgaaggtt gagaactgac 180aaaatcagca gggaagaact gttcgaattc tggtcgcaaa ttaccgatca aagttttgat 240tcgcggctcc agatcttctt cgacatgtaa ttattcggtc attattgttg ttatcgatgc 300tcaataattt aattaatgat aagtaatctt cctaagttta ctttttgttc tttaattaag 360tctgcgacac ctaatgacaa attaaacaca gcattgtcaa atgtcaccgt ttatttgctt 420taatatttaa atgtccgtta aagcgttgat gttttctttg cttgcaagtt gtgatgcaat 480tatgatcatt acgtgctttt gtatttaaaa taaatcataa tcaac 525 <210> SEQ ID NO 70<211> LENGTH: 54 <212> TYPE: PRT <213> ORGANISM: Glycine max <400>SEQUENCE: 70 Gly Met Lys Asp Ser Lys Glu Phe Ala Leu Glu Leu Phe Asp AlaLeu 1 5 10 15 Ser Arg Lys Arg Arg Leu Arg Thr Asp Lys Ile Ser Arg GluGlu Leu 20 25 30 Phe Glu Phe Trp Ser Gln Ile Thr Asp Gln Ser Phe Asp SerArg Leu 35 40 45 Gln Ile Phe Phe Asp Met 50 <210> SEQ ID NO 71 <211>LENGTH: 1840 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 71ccacgcgtcc gaagcgccga accgcgttcc aggtgatggg ctactgcgtc tgcgtcgcca 60agggtgccgc tgagatcctc aagctcaaca tggctctcat cctgctgccc gtctgccgga 120atacgctgac gacgctcagg tccaccgcgc tcagccatgt catacccttc gatgacaaca 180tcaacttcca caaggtcatc gcgctgtcca tcgcgatcgc cacagcgatc cacacgctcg 240cacacgtgac ctgcgacttc ccaaggctga tcagctgccc gacggacaag ttcatggcca 300ccttggggtc caacttccac tacaagcagc cgacttacct gggcttgctg gagagcacac 360ccggggttac cggaatcctc atgatcatca taatgtcctt ctccttcacg ctggcaacac 420attccttcag gcggagtgtg gtgaagctgc catcgccgct acaccacctt gccggtttca 480atgccttctg gtacgctcac cacctgctgg tccttgcgta tgtcctgctg gtggtgcact 540cctacttcat attcctcacc agggagtggt acaagaagac gacatggatg tacctgattg 600tccctgtcct cttctatgcc tgtgaaagag tcatcaggaa atttcgtgag aacaactacc 660atgcgggaat tgtgagggca gcaatttatc cgggagatgt gctctctatt cacatgaaga 720agccacaggg tttcaagtac aagagtggga tgtatctgtt tgttaaatgc ccagaagtct 780cgcccttcga gtggcacccc ttctctataa cttcggcacc aggcgatgac tacttgagtg 840tgcatatccg tacgctgggt gactggacat ccgaactgcg gatgcttttt gggaaggctt 900gccaggcaca agtaacttcc aagaaggcta cccttacaag acttgaaact acagttgtgg 960cagacgccca gacagaggac actaggtttc ccaaggtcta catagacggg ccatacggtg 1020caccagcaca aaattacagg aaatatgaca ttcttctgct tattggcctt ggaataggag 1080caactccttt catcagcata ctgaaggata tgttgaacaa cctaaaatcc aacgaagagg 1140tgggaagcat ccacggctct gagataggca gcttcaagaa caatggtcca ggaagggctt 1200acttctactg ggtcaccaga gagcaaggat ctttcgaatg gttcaaagga gtcatgaatg 1260aggttgcagg gagcgatcac agcaatgtta tagagatgca caattacctg accagcgtgt 1320atgaagaagg tgacgcaagg tcagctctga ttgccatggt acagtcactt cagcgtgcta 1380aaaacggcgt ggatatcgtc tccggcagca agattcgaac acattttgca agaccaaact 1440ggaggaaggt attctgtgat ttggccagcg cacacaagaa ctctcgcata ggagttttct 1500attgtggatc tccgacgctc acaaaacaac tgaaggatct ttcgaaagaa ttcagccaga 1560caaccacaac ccggttccat ttccacaagg aaaacttctg agacgacgtg tacccgaaga 1620tccgatggac tggaaacata attgtatagg gaaaaaatac gatagcattg gcatagcaga 1680tttagtttta caagttttga tgtatgcggg attgtacaaa atatgtgtag aaagctagat 1740gtcaccatca tacatagatt ctgaaatgct tgcagatata tatattgcat tgcataagtg 1800aaaccacttg cttcctaaaa aaaaaaaaaa aaaaaaaaag 1840 <210> SEQ ID NO 72<211> LENGTH: 532 <212> TYPE: PRT <213> ORGANISM: Zea mays <400>SEQUENCE: 72 Thr Arg Pro Lys Arg Arg Thr Ala Phe Gln Val Met Gly Tyr CysVal 1 5 10 15 Cys Val Ala Lys Gly Ala Ala Glu Ile Leu Lys Leu Asn MetAla Leu 20 25 30 Ile Leu Leu Pro Val Cys Arg Asn Thr Leu Thr Thr Leu ArgSer Thr 35 40 45 Ala Leu Ser His Val Ile Pro Phe Asp Asp Asn Ile Asn PheHis Lys 50 55 60 Val Ile Ala Leu Ser Ile Ala Ile Ala Thr Ala Ile His ThrLeu Ala 65 70 75 80 His Val Thr Cys Asp Phe Pro Arg Leu Ile Ser Cys ProThr Asp Lys 85 90 95 Phe Met Ala Thr Leu Gly Ser Asn Phe His Tyr Lys GlnPro Thr Tyr 100 105 110 Leu Gly Leu Leu Glu Ser Thr Pro Gly Val Thr GlyIle Leu Met Ile 115 120 125 Ile Ile Met Ser Phe Ser Phe Thr Leu Ala ThrHis Ser Phe Arg Arg 130 135 140 Ser Val Val Lys Leu Pro Ser Pro Leu HisHis Leu Ala Gly Phe Asn 145 150 155 160 Ala Phe Trp Tyr Ala His His LeuLeu Val Leu Ala Tyr Val Leu Leu 165 170 175 Val Val His Ser Tyr Phe IlePhe Leu Thr Arg Glu Trp Tyr Lys Lys 180 185 190 Thr Thr Trp Met Tyr LeuIle Val Pro Val Leu Phe Tyr Ala Cys Glu 195 200 205 Arg Val Ile Arg LysPhe Arg Glu Asn Asn Tyr His Ala Gly Ile Val 210 215 220 Arg Ala Ala IleTyr Pro Gly Asp Val Leu Ser Ile His Met Lys Lys 225 230 235 240 Pro GlnGly Phe Lys Tyr Lys Ser Gly Met Tyr Leu Phe Val Lys Cys 245 250 255 ProGlu Val Ser Pro Phe Glu Trp His Pro Phe Ser Ile Thr Ser Ala 260 265 270Pro Gly Asp Asp Tyr Leu Ser Val His Ile Arg Thr Leu Gly Asp Trp 275 280285 Thr Ser Glu Leu Arg Met Leu Phe Gly Lys Ala Cys Gln Ala Gln Val 290295 300 Thr Ser Lys Lys Ala Thr Leu Thr Arg Leu Glu Thr Thr Val Val Ala305 310 315 320 Asp Ala Gln Thr Glu Asp Thr Arg Phe Pro Lys Val Tyr IleAsp Gly 325 330 335 Pro Tyr Gly Ala Pro Ala Gln Asn Tyr Arg Lys Tyr AspIle Leu Leu 340 345 350 Leu Ile Gly Leu Gly Ile Gly Ala Thr Pro Phe IleSer Ile Leu Lys 355 360 365 Asp Met Leu Asn Asn Leu Lys Ser Asn Glu GluVal Gly Ser Ile His 370 375 380 Gly Ser Glu Ile Gly Ser Phe Lys Asn AsnGly Pro Gly Arg Ala Tyr 385 390 395 400 Phe Tyr Trp Val Thr Arg Glu GlnGly Ser Phe Glu Trp Phe Lys Gly 405 410 415 Val Met Asn Glu Val Ala GlySer Asp His Ser Asn Val Ile Glu Met 420 425 430 His Asn Tyr Leu Thr SerVal Tyr Glu Glu Gly Asp Ala Arg Ser Ala 435 440 445 Leu Ile Ala Met ValGln Ser Leu Gln Arg Ala Lys Asn Gly Val Asp 450 455 460 Ile Val Ser GlySer Lys Ile Arg Thr His Phe Ala Arg Pro Asn Trp 465 470 475 480 Arg LysVal Phe Cys Asp Leu Ala Ser Ala His Lys Asn Ser Arg Ile 485 490 495 GlyVal Phe Tyr Cys Gly Ser Pro Thr Leu Thr Lys Gln Leu Lys Asp 500 505 510Leu Ser Lys Glu Phe Ser Gln Thr Thr Thr Thr Arg Phe His Phe His 515 520525 Lys Glu Asn Phe 530 <210> SEQ ID NO 73 <211> LENGTH: 1528 <212>TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 73 tttagatctttcatctttgt atatatactc tttcttcttc ttgcttctgt tttttttttt 60 tgttagcttaaagtgtgttg catctgaatt tttatttatt ccgaaggtat gaaggattcg 120 aaggaattcgctctggaact gtttgatgct ctgagtcgta aacgaaggtt gagaactgac 180 aaaatcagcagggaagaact gttcgaattc tggtcgcaaa ttaccgatca aagttttgat 240 tcgcggctccagatcttctt cgacatcgct tctgcaaata ggttgtccag attgcaggaa 300 caggctgaagaatatgcagc tctaatcatg gaagagttgg accccgaagg acttggctac 360 atcgaggtgggaaatctttt tatattccat aatatattct aattaaattt atcaacacat 420 gcatacacatagtgaaaata atgtgttcat gttatttctt ttcataaaga aaacatacag 480 acgattaatgagtagcatgt gtattgcgtg gtgaacagga ataatccatt actttagcat 540 tattgttttatataatttcc acaagtctac taaaataaaa taaaaacata agtttttctt 600 ttccttgttatataaggttt tcttggcaca ttacgggaac aattgctccc gcaattccaa 660 acataaaagaatttacttgt ttttatttta ttttttataa atctttaaaa atatgcaata 720 tagaaataataaggtatttt tttgtaatta tcaatacaaa aagaaaatta tatatcgaat 780 ttaacagcacttaattttta caggcaaatt tgtttaacaa atactgacga gtttcttaaa 840 aaaatattaaataaaaaatt aaatttatta attagttaaa aatggtgtgt ttatttttta 900 atttttaacaagtacattta aacatgaatt agttgatgat gcttgtttgt tagttagctt 960 agtgtacattaatttcctag cttagtcagt gttgggctag ggccgcactt atctaattag 1020 tcaacaatggaccaaattgt gaattgaaca agtcacccga aatttcctag tgtagttcag 1080 ccgttttcaagggtaaaaag aaaaagaaag acaaattagg catattataa cttataagta 1140 aaaatttgcttgctgaattt aaaaaattga tttgatatac ttacaacatc atttcattaa 1200 ttacgtaaaaaaatattatt agtaatagac taatagggtg cactcattaa catgcaatct 1260 tatttcttcattcttttgcc gtgtcaacaa atttaaattc agaccacttt atccctagct 1320 aggtacgtaatataataatt aagctgttat ttctaataac tcagttcatg cagtgcagcg 1380 gccgcggtgctggttaatat aaattattac agtgacctca ttatttcttt gatcaataag 1440 acatttttttccagaagata ttaattataa ttttgagtag tagtggtttt tgtctttggg 1500 catgtcttcagagtggatgg agagagat 1528 <210> SEQ ID NO 74 <211> LENGTH: 104 <212>TYPE: PRT <213> ORGANISM: Glycine max <400> SEQUENCE: 74 Ile Phe Ile TyrSer Glu Gly Met Lys Asp Ser Lys Glu Phe Ala Leu 1 5 10 15 Glu Leu PheAsp Ala Leu Ser Arg Lys Arg Arg Leu Arg Thr Asp Lys 20 25 30 Ile Ser ArgGlu Glu Leu Phe Glu Phe Trp Ser Gln Ile Thr Asp Gln 35 40 45 Ser Phe AspSer Arg Leu Gln Ile Phe Phe Asp Ile Ala Ser Ala Asn 50 55 60 Arg Leu SerArg Leu Gln Glu Gln Ala Glu Glu Tyr Ala Ala Leu Ile 65 70 75 80 Met GluGlu Leu Asp Pro Glu Gly Leu Gly Tyr Ile Glu Val Gly Asn 85 90 95 Leu PheIle Phe His Asn Ile Phe 100 <210> SEQ ID NO 75 <211> LENGTH: 486 <212>TYPE: DNA <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure<222> LOCATION: (456) <221> NAME/KEY: unsure <222> LOCATION: (465) <400>SEQUENCE: 75 accaaatccc gatgttctgt gcaacacgag aataaagttc gggtctttcttagaagccat 60 tgaaaattta ggatttgact acatagcttc aggacactat gcacacgtagtacatccttc 120 cgctgaaaat accgaagcgc catccgtact acaattatca aaggacaagatcaaggatca 180 aacctatttc ctctcacatc tatctcaatc tcagcttaga aggcttcttttcccactagg 240 atgtattaca aaggatgagg ttcgcagact ggctactcaa atgggccttcctaaccaagg 300 taggaaagac tcacaaggga tatgctttct tggaaaggta tacactgtacctttttttct 360 tagggatcaa tgtttttaac aaatccattg gggagctata caagtatcctctttctaggg 420 tcaagtttaa gtgaagtttg ttgaaaagac atatanggag aaaanggaagggaatcatac 480 tgggaa 486 <210> SEQ ID NO 76 <211> LENGTH: 36 <212>TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 76 Pro Asn Pro AspVal Leu Cys Asn Thr Arg Ile Lys Phe Gly Ser Phe 1 5 10 15 Leu Glu AlaIle Glu Asn Leu Gly Phe Asp Tyr Ile Ala Ser Gly His 20 25 30 Tyr Ala HisVal 35 <210> SEQ ID NO 77 <211> LENGTH: 321 <212> TYPE: DNA <213>ORGANISM: Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222>LOCATION: (94) <221> NAME/KEY: unsure <222> LOCATION: (99) <221>NAME/KEY: unsure <222> LOCATION: (158) <221> NAME/KEY: unsure <222>LOCATION: (170) <221> NAME/KEY: unsure <222> LOCATION: (226) <221>NAME/KEY: unsure <222> LOCATION: (241) <221> NAME/KEY: unsure <222>LOCATION: (270)..(271) <221> NAME/KEY: unsure <222> LOCATION: (273)<221> NAME/KEY: unsure <222> LOCATION: (280)..(281) <221> NAME/KEY:unsure <222> LOCATION: (294)..(295)..(296) <221> NAME/KEY: unsure <222>LOCATION: (313) <221> NAME/KEY: unsure <222> LOCATION: (315) <400>SEQUENCE: 77 cgcacacctc gcctcctctc tccggcgacc tggaccgcta cctccgctgctccatgccgc 60 agaactcgcc gctccgagtc gcggttcttg tcancggang cgtcgatagcagcgtcgctc 120 tccggctgct ccacgccgct ggccactcct gcaccgcntt ctacctcaanatatggttcc 180 aagaagattt cgagaacttt tggtctgagt gcccctggga agacgntttgaagtatgctc 240 nagatgtttg taatcaggtt gatgtaccgn nanaagttgn ncatttaaccgatnnntact 300 ggaacaatgt ggntncttac c 321 <210> SEQ ID NO 78 <211>LENGTH: 83 <212> TYPE: PRT <213> ORGANISM: Glycine max <220> FEATURE:<221> NAME/KEY: UNSURE <222> LOCATION: (8) <221> NAME/KEY: UNSURE <222>LOCATION: (10) <221> NAME/KEY: UNSURE <222> LOCATION: (33) <221>NAME/KEY: UNSURE <222> LOCATION: (52) <221> NAME/KEY: UNSURE <222>LOCATION: (57) <221> NAME/KEY: UNSURE <222> LOCATION: (67)..(68) <221>NAME/KEY: UNSURE <222> LOCATION: (70) <221> NAME/KEY: UNSURE <222>LOCATION: (75) <221> NAME/KEY: UNSURE <222> LOCATION: (81)..(82) <400>SEQUENCE: 78 Leu Arg Val Ala Val Leu Val Xaa Gly Xaa Val Asp Ser Ser ValAla 1 5 10 15 Leu Arg Leu Leu His Ala Ala Gly His Ser Cys Thr Ala PheTyr Leu 20 25 30 Xaa Ile Trp Phe Gln Glu Asp Phe Glu Asn Phe Trp Ser GluCys Pro 35 40 45 Trp Glu Asp Xaa Leu Lys Tyr Ala Xaa Asp Val Cys Asn GlnVal Asp 50 55 60 Val Pro Xaa Xaa Val Xaa His Leu Thr Asp Xaa Tyr Trp AsnAsn Val 65 70 75 80 Xaa Xaa Tyr <210> SEQ ID NO 79 <211> LENGTH: 1412<212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 79 gcacgagaccaaatcccgat gttctgtgca acacgagaat aaagttcggg tctttcttag 60 aagccattgaaaatttagga tttgactaca tagcttcagg acactatgca cacgtagtac 120 atccttccgctgaaaatacc gaagcgccat ccgtactaca attatcaaag gacaagatca 180 aggatcaaacctatttcctc tcacatctat ctcaatctca gcttagaagg cttcttttcc 240 cactaggatgtattacaaag gatgaggttc gcagactggc tactcaaatg ggccttccta 300 accaaggtaggaaagactca caagggatat gctttcttgg aaagtttagt gagtttgttg 360 aaagacatataggagaaaag gaaggaatca tactggaagc tgagtccgga gattacctag 420 ggatacaccgtgggttttgg ttttatacaa ttggtcagcg gcaaggatta cggcttcctg 480 gaggaccttggtacgttgta gagaaagatg ttcaaaacaa tgtggttttt gtatcaagga 540 attattattcattggataag cggaggcgca catttcgtgt tggatcacta aactggtttg 600 ataattgtggacctggaaac aatgagcaac tcaaatgcaa ggtgcgacat agccctgaat 660 ttcatgattgcactttggta aaagagcgta gtgaagaaaa tggggatgct ttggtggtgc 720 atctggctgaagatgaccag ggcttggcag ctggtcagtt tgcagcgttc tatagtgaaa 780 atgtatgcctggggtcaggc ataattttgg attcatggga caagatgagc ttccctgtct 840 gctcaagggcactcgaaatc gcaaagttgg cggacaagtc gagcttaggg aagccaataa 900 gaatcgtgaacctggaacat atcgtgaaac ctgaacaaca ggaggaaatc aaagttgcct 960 gacagattggtgccgtaacc ttctcctagc tccgccagct gcccccttca aatcaattca 1020 tcaatcaaccgtatcgcagg ccctggaaat cgaaggatga ggggtcttct agaatctacc 1080 gatatatacccccagttgga aactccaggt ccttcttgga aggatgatat agtggaatat 1140 ggcatgacttggtgcgctcg tgttgcagag gaacgatgaa gcttcgtgcg tggatagggt 1200 tgatctgaattaactgttga cagtgatttt acgatctcgc gggaaagcaa cacaatagtt 1260 tgggcacgtcctctcctgag tggaatgatc tggattcagt agtagtcagt ggcacttcat 1320 taaaactaatacagggaaat cgtgacgttg tatgatagtg gtaagatatt gtattgtgaa 1380 ttttgacgataatatcggtg cgtgattgtg cc 1412 <210> SEQ ID NO 80 <211> LENGTH: 319 <212>TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 80 Thr Arg Pro AsnPro Asp Val Leu Cys Asn Thr Arg Ile Lys Phe Gly 1 5 10 15 Ser Phe LeuGlu Ala Ile Glu Asn Leu Gly Phe Asp Tyr Ile Ala Ser 20 25 30 Gly His TyrAla His Val Val His Pro Ser Ala Glu Asn Thr Glu Ala 35 40 45 Pro Ser ValLeu Gln Leu Ser Lys Asp Lys Ile Lys Asp Gln Thr Tyr 50 55 60 Phe Leu SerHis Leu Ser Gln Ser Gln Leu Arg Arg Leu Leu Phe Pro 65 70 75 80 Leu GlyCys Ile Thr Lys Asp Glu Val Arg Arg Leu Ala Thr Gln Met 85 90 95 Gly LeuPro Asn Gln Gly Arg Lys Asp Ser Gln Gly Ile Cys Phe Leu 100 105 110 GlyLys Phe Ser Glu Phe Val Glu Arg His Ile Gly Glu Lys Glu Gly 115 120 125Ile Ile Leu Glu Ala Glu Ser Gly Asp Tyr Leu Gly Ile His Arg Gly 130 135140 Phe Trp Phe Tyr Thr Ile Gly Gln Arg Gln Gly Leu Arg Leu Pro Gly 145150 155 160 Gly Pro Trp Tyr Val Val Glu Lys Asp Val Gln Asn Asn Val ValPhe 165 170 175 Val Ser Arg Asn Tyr Tyr Ser Leu Asp Lys Arg Arg Arg ThrPhe Arg 180 185 190 Val Gly Ser Leu Asn Trp Phe Asp Asn Cys Gly Pro GlyAsn Asn Glu 195 200 205 Gln Leu Lys Cys Lys Val Arg His Ser Pro Glu PheHis Asp Cys Thr 210 215 220 Leu Val Lys Glu Arg Ser Glu Glu Asn Gly AspAla Leu Val Val His 225 230 235 240 Leu Ala Glu Asp Asp Gln Gly Leu AlaAla Gly Gln Phe Ala Ala Phe 245 250 255 Tyr Ser Glu Asn Val Cys Leu GlySer Gly Ile Ile Leu Asp Ser Trp 260 265 270 Asp Lys Met Ser Phe Pro ValCys Ser Arg Ala Leu Glu Ile Ala Lys 275 280 285 Leu Ala Asp Lys Ser SerLeu Gly Lys Pro Ile Arg Ile Val Asn Leu 290 295 300 Glu His Ile Val LysPro Glu Gln Gln Glu Glu Ile Lys Val Ala 305 310 315 <210> SEQ ID NO 81<211> LENGTH: 1667 <212> TYPE: DNA <213> ORGANISM: Glycine max <400>SEQUENCE: 81 gcacgagcgc acacctcgcc tcctctctcc ggcgacctgg accgctacctccgctgctcc 60 atgccgcaga actcgccgct ccgagtcgcg gttcttgtca gcggaggcgtcgatagcagc 120 gtcgctctcc ggctgctcca cgccgctggc cactcctgca ccgctttctacctcaagata 180 tggttccaag aagatttcga gaacttttgg tctgagtgcc cctgggaagacgatttgaag 240 tatgctaaag atgtttgtaa tcaggttgat gtaccgttag aagttgttcatttaaccgat 300 gaatactgga acaatgtggt ttcttacctc attgaagagt atagtagtggccgaacccct 360 aaccctgatg ttctgtgcaa tacaagaatt aagtttggtg cattcttggatgcgattggt 420 ggcatgggtt ttgactatgt tgcctctggg cattatgcaa atgttatccacccatgttct 480 gatcggaggg atgaaccttc tgttctggaa ctatcacaag acatggtaaaggatcaaaca 540 tacttccttt cacacctctc acaatcccag ctgaagcaac ttctttttccgcttggttgt 600 attcccaagg atgaagtccg caagcttgcc acaaaatttg atctaccaaataaggataga 660 aaggattccc aaggaatatg ctttttgggc aagataagat tcagtgaatttgttgcaaga 720 catattgggg agagagaagg tgtcatacta gaagctgaga caggagatttccttggcaaa 780 catcgaggat tttggtttta tactattggt cagcgccagg gtctacggctacctggaggc 840 ccatggtatg ttgttgagaa ggatgttaaa aacaatgtag tttttgtgtcaagaaattac 900 ttttcctttg acaaaagaag gcgtgtattc cgtgttggct cttttaaatggcttagcggg 960 ttgcccccag gccagacaac tcagctccaa tgcaaggtga gacatggtcctggattttat 1020 gattgtagct tacaaatgga agtcgaagct gatggccaat gtcactctgctgttgtccgc 1080 atatctgaag atgatcaagg cctagcagct gggcagtttg cggcattctatgagggaaga 1140 acatgcatag gttctggtgt gattttggag ttctgggatg atcagagttttccagtttgt 1200 acaaaagctt tagaaattgc tagaatggaa gataaatcaa agattgggaatccggtaaaa 1260 ataaaagtta aaccagataa cccgcaagaa gtgtgtgatt ctgcagagttagcaagtaaa 1320 acatcacaag ggaataagaa attaaagagt tgctggcact gaacaaaacagactgacgtg 1380 taagaggaat gcagcttcca ttttcccttt caattggcca caacagagcattggcaaatg 1440 ggggtataca tgcagatttt ctgatccttc gttctgtaac ttttttatatacatgtttag 1500 ggaaacaaga agtattttat agagagaagg atagcgattt aaggtaggtgagaaattagt 1560 agcttctatt attcggtgtt tccaattaaa atgtgtatat actttgtcatggtttgcttg 1620 atactttatt cagtttattg acaaatgtaa aaaaaaaaaa aaaaaaa 1667<210> SEQ ID NO 82 <211> LENGTH: 361 <212> TYPE: PRT <213> ORGANISM:Glycine max <400> SEQUENCE: 82 Arg Val Ala Val Leu Val Ser Gly Gly ValAsp Ser Ser Val Ala Leu 1 5 10 15 Arg Leu Leu His Ala Ala Gly His SerCys Thr Ala Phe Tyr Leu Lys 20 25 30 Ile Trp Phe Gln Glu Asp Phe Glu AsnPhe Trp Ser Glu Cys Pro Trp 35 40 45 Glu Asp Asp Leu Lys Tyr Ala Lys AspVal Cys Asn Gln Val Asp Val 50 55 60 Pro Leu Glu Val Val His Leu Thr AspGlu Tyr Trp Asn Asn Val Val 65 70 75 80 Ser Tyr Leu Ile Glu Glu Tyr SerSer Gly Arg Thr Pro Asn Pro Asp 85 90 95 Val Leu Cys Asn Thr Arg Ile LysPhe Gly Ala Phe Leu Asp Ala Ile 100 105 110 Gly Gly Met Gly Phe Asp TyrVal Ala Ser Gly His Tyr Ala Asn Val 115 120 125 Ile His Pro Cys Ser AspArg Arg Asp Glu Pro Ser Val Leu Glu Leu 130 135 140 Ser Gln Asp Met ValLys Asp Gln Thr Tyr Phe Leu Ser His Leu Ser 145 150 155 160 Gln Ser GlnLeu Lys Gln Leu Leu Phe Pro Leu Gly Cys Ile Pro Lys 165 170 175 Asp GluVal Arg Lys Leu Ala Thr Lys Phe Asp Leu Pro Asn Lys Asp 180 185 190 ArgLys Asp Ser Gln Gly Ile Cys Phe Leu Gly Lys Ile Arg Phe Ser 195 200 205Glu Phe Val Ala Arg His Ile Gly Glu Arg Glu Gly Val Ile Leu Glu 210 215220 Ala Glu Thr Gly Asp Phe Leu Gly Lys His Arg Gly Phe Trp Phe Tyr 225230 235 240 Thr Ile Gly Gln Arg Gln Gly Leu Arg Leu Pro Gly Gly Pro TrpTyr 245 250 255 Val Val Glu Lys Asp Val Lys Asn Asn Val Val Phe Val SerArg Asn 260 265 270 Tyr Phe Ser Phe Asp Lys Arg Arg Arg Val Phe Arg ValGly Ser Phe 275 280 285 Lys Trp Leu Ser Gly Leu Pro Pro Gly Gln Thr ThrGln Leu Gln Cys 290 295 300 Lys Val Arg His Gly Pro Gly Phe Tyr Asp CysSer Leu Gln Met Glu 305 310 315 320 Val Glu Ala Asp Gly Gln Cys His SerAla Val Val Arg Ile Ser Glu 325 330 335 Asp Asp Gln Gly Leu Ala Ala GlyGln Phe Ala Ala Phe Tyr Glu Gly 340 345 350 Arg Thr Cys Ile Gly Ser GlyVal Ile 355 360 <210> SEQ ID NO 83 <211> LENGTH: 555 <212> TYPE: DNA<213> ORGANISM: Helianthus tuberosus <220> FEATURE: <221> NAME/KEY:unsure <222> LOCATION: (41) <221> NAME/KEY: unsure <222> LOCATION: (394)<221> NAME/KEY: unsure <222> LOCATION: (448) <221> NAME/KEY: unsure<222> LOCATION: (462) <221> NAME/KEY: unsure <222> LOCATION: (476) <221>NAME/KEY: unsure <222> LOCATION: (510) <221> NAME/KEY: unsure <222>LOCATION: (519)..(520) <221> NAME/KEY: unsure <222> LOCATION: (543)<400> SEQUENCE: 83 tcatcaccaa aatggcgaaa ggagtgaaga gaaacacgaancaggaagcc attgattcat 60 ccacacattc ttccgataca aaaccatcgc ctccgttaaagaaatgcaaa acatctccgt 120 cggttgctgt ttcggccgat gaagaggctc ggttcgttggaaaaccggtt ccggctgagc 180 aagcgagggg gaaatggcct catcgatatg aatcgaagaataaagtgaag gttatttcgt 240 catctgatgg tgaggagaag gagatttttc aagctaagtgccattatacc aaagctactg 300 ttgatggtat ttcttttgat ctttatgatg atgcttatgtcaaggctgaa gaagggaacc 360 agattacatt gctcggatcg tggagatgtt tganaccgttgataaagatt atatttctcc 420 gctcagtggt tttcagagct gaagatangt tattaaagccangccacttt cgacanagac 480 atgtctatcc gaatgaatat gacaatcacn gatagttgnncaaataagtt gtcactcccc 540 aangttgact tgtta 555 <210> SEQ ID NO 84 <211>LENGTH: 765 <212> TYPE: PRT <213> ORGANISM: Helianthus tuberosus <400>SEQUENCE: 84 Phe Val Gly Lys Pro Val Pro Ala Glu Gln Ala Arg Gly Lys TrpPro 1 5 10 15 His Arg Tyr Glu Ser Lys Asn Lys Val Lys Val Ile Ser SerSer Asp 20 25 30 Gly Glu Glu Lys Glu Ile Phe Gln Ala Lys Cys His Tyr ThrLys Ala 35 40 45 Thr Val Asp Gly Ile Ser Phe Asp Leu Tyr Asp Asp Ala TyrVal Lys 50 55 60 Ala Glu Glu Gly Lys Pro Asp Tyr Ile Ala Arg Ile Val GluMet Phe 65 70 75 80 Glu Thr Val Asp Lys Glu Leu Tyr Phe Ser Ala Gln TrpPhe Phe Arg 85 90 95 Ala Glu Asp Thr Val Ile Lys Ser Gln Ala His Leu IleAsp Lys Arg 100 105 110 Arg Val Phe Tyr Ser Glu Met Lys Asp Asp Asn ProLeu Asp Ser Ile 115 120 125 Val Ser Lys Phe Lys Ile Val Gln Leu Pro ProAsn Val Asp Leu Val 130 135 140 Glu Lys Glu Lys Ala Leu Ser Ser Tyr AspTyr Tyr Tyr Asp Met Gln 145 150 155 160 Tyr Ser Lys Pro Val Thr Phe ThrThr Leu His Lys Glu Asn Leu Thr 165 170 175 Thr Glu Ser Gly Glu Ser SerVal Val Ser Asp Asp Ala Cys Ser Asn 180 185 190 Gly Val Val Glu Ser AsnAsn Lys Asn Ala Lys Pro Thr Lys Ile Asn 195 200 205 Asp Ser Glu Lys SerGlu Met Thr Leu Leu Asp Leu Tyr Ser Gly Cys 210 215 220 Gly Ala Met SerThr Gly Leu Cys Tyr Gly Thr Asn Met Ala Gly Val 225 230 235 240 Lys LeuVal Thr Lys Trp Ala Val Asp Ile Asn Glu His Ala Cys Glu 245 250 255 SerLeu Lys Leu Asn His Ala Glu Thr Gln Val Arg Asn Glu Ala Ala 260 265 270Asp Asp Phe Leu Leu Leu Leu Lys Glu Trp Glu Lys Leu Cys Lys Gln 275 280285 Phe Gly Leu Leu Gly Ser Lys Arg Asp Asp Thr Asn Val Lys Ser Glu 290295 300 Glu Ser Asp Ser Glu Glu Ile Asp Glu Ser Arg Asp Pro Tyr Lys Gly305 310 315 320 Glu Phe Glu Val Gln Arg Leu Thr Ala Val Cys Tyr Gly AspPro Asn 325 330 335 Lys Ala Asn Lys Gln Lys Leu His Phe Lys Val Arg TrpLys Gly Phe 340 345 350 Gly Pro Ser Tyr Asp Thr Trp Glu Pro Val Asp GlyLeu Ser Asn Cys 355 360 365 Glu Glu Ser Ile Lys Asp Phe Val Ile Lys GlyTyr Lys Ser Arg Met 370 375 380 Leu Pro Leu Pro Gly Asp Val Asp Phe IleCys Gly Gly Pro Pro Cys 385 390 395 400 Gln Gly Ile Ser Gly His Asn ArgPhe Arg Asn Tyr Thr Asp Pro Leu 405 410 415 Lys Asp Pro Lys Asn His GlnLeu Val Val Tyr Met Asp Ile Ile Asp 420 425 430 Phe Leu Lys Pro Lys PheVal Leu Met Glu Asn Val Cys Asp Leu Val 435 440 445 Lys Phe Ala Asp GlyIle Leu Gly Tyr His Ala Val Gly Arg Leu Val 450 455 460 Ser Met Asn TyrGln Thr Arg Met Gly Ile Met Ala Ala Gly Ser Tyr 465 470 475 480 Gly ValPro Gln Cys Arg Leu Arg Val Phe Leu Trp Gly Ala Asn Thr 485 490 495 MetMet Asn Leu Pro Gln Phe Pro Leu Pro Thr His Glu Val Val Gly 500 505 510Arg Gly Val Val Pro Val Glu Phe Lys Asp Cys Ile Val Gly Ser Asp 515 520525 Asn Asp Asn Ser Tyr Lys Leu Glu Lys Ser Ile Arg Leu Gly Asp Ala 530535 540 Ile Ser Asp Leu Pro Glu Val Thr Asn Asn Lys Gly Lys Asp Glu Met545 550 555 560 Glu Tyr Ala Gly Ala Pro Gln Thr Ser Phe Gln Lys Tyr IleArg Leu 565 570 575 Arg Lys Gln Ala Leu Gly Lys Asp Ser Ser Lys Arg MetMet Leu Tyr 580 585 590 Asp His Arg Pro Leu Glu Leu Asn Glu Asp Asp TyrAla Arg Val Cys 595 600 605 His Ile Pro Lys Ile Lys Gly Ala Asn Phe ArgAsp Leu Pro Gly Val 610 615 620 Arg Val Gly Lys Gly Asn Lys Val Glu LeuAsp Pro Asp Val Glu Arg 625 630 635 640 Val Leu Leu Pro Ser Gly Lys ProLeu Val Pro Asn Tyr Ala Ile Thr 645 650 655 Phe Val Arg Gly Thr Ser LysLys Pro Phe Gly Arg Leu Gly Met Asp 660 665 670 Asp Ile Val Thr Thr ValVal Gly Arg Ala Glu Pro His Asn Gln Ala 675 680 685 Leu Leu His Pro AsnGln Asp Arg Val Leu Thr Ile Arg Glu Asn Ala 690 695 700 Arg Leu Gln GlyPhe Pro Asp His Tyr Lys Leu Cys Gly Pro Val Lys 705 710 715 720 Ala ArgTyr Met Gln Ile Gly Asn Ala Val Ser Phe Ser Val Ser Thr 725 730 735 GlyLeu Gly Tyr Ser Leu Ala Lys Ala Ile Gln Gly Val Cys Thr Ser 740 745 750Lys Pro Ile Lys Leu Pro His Lys Phe Pro Asp Cys Leu 755 760 765 <210>SEQ ID NO 85 <211> LENGTH: 605 <212> TYPE: DNA <213> ORGANISM: Zea mays<220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (307) <221>NAME/KEY: unsure <222> LOCATION: (325)..(326)..(327) <221> NAME/KEY:unsure <222> LOCATION: (329) <221> NAME/KEY: unsure <222> LOCATION:(357) <221> NAME/KEY: unsure <222> LOCATION: (364) <221> NAME/KEY:unsure <222> LOCATION: (382) <221> NAME/KEY: unsure <222> LOCATION:(390) <221> NAME/KEY: unsure <222> LOCATION: (393) <221> NAME/KEY:unsure <222> LOCATION: (489) <221> NAME/KEY: unsure <222> LOCATION:(534) <221> NAME/KEY: unsure <222> LOCATION: (537) <221> NAME/KEY:unsure <222> LOCATION: (569) <221> NAME/KEY: unsure <222> LOCATION:(575) <221> NAME/KEY: unsure <222> LOCATION: (596) <400> SEQUENCE: 85gggagtatgg tggttccccc aagacagagt tccagcgcta cattcgactt ggtcgtaaag 60acatgttgga ttggtcgttt ggtgaggagg ctggtccaga tgaaggcaag ctcttggatc 120accagccctt acggcttaac aatgatgatt atgagcgggt taagcaaatt cctgtcaaga 180agggagccaa cttccgtgac ctaaagggtg tcaaggttgg agcaaataat gttgttgagt 240gggatccaga agtcgaacgt gtgtaccttt cgtctgggaa accactggtt cctgactatg 300cgatgtnatt catcaagggc aaatnnntna agccattcgg gcgcctgtgg tgggacnaga 360cggntcctac agtttgtgac cngagcagan ctnataacca ggttatattg catccgactt 420aagcaagagt cttgactatc cgggagaacg ccaagggtac aggggctttc ccgattacta 480cccgattgnt ttggaccgat caaggagaag tatattcaag tccggaacgc caanggnagt 540ccctggttgc acgggcactg ggctactgnc tgggncaagc ctacctgggg aatctnacgg 600gatca 605 <210> SEQ ID NO 86 <211> LENGTH: 184 <212> TYPE: PRT <213>ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION:(98) <221> NAME/KEY: UNSURE <222> LOCATION: (104)..(105) <221> NAME/KEY:UNSURE <222> LOCATION: (115) <221> NAME/KEY: UNSURE <222> LOCATION:(117) <221> NAME/KEY: UNSURE <222> LOCATION: (123) <221> NAME/KEY:UNSURE <222> LOCATION: (126)..(127) <221> NAME/KEY: UNSURE <222>LOCATION: (136) <221> NAME/KEY: UNSURE <222> LOCATION: (159) <221>NAME/KEY: UNSURE <222> LOCATION: (174)..(175) <400> SEQUENCE: 86 Ser ProLys Thr Glu Phe Gln Arg Tyr Ile Arg Leu Gly Arg Lys Asp 1 5 10 15 MetLeu Asp Trp Ser Phe Gly Glu Glu Ala Gly Pro Asp Glu Gly Lys 20 25 30 LeuLeu Asp His Gln Pro Leu Arg Leu Asn Asn Asp Asp Tyr Glu Arg 35 40 45 ValLys Gln Ile Pro Val Lys Lys Gly Ala Asn Phe Arg Asp Leu Lys 50 55 60 GlyVal Lys Val Gly Ala Asn Asn Val Val Glu Trp Asp Pro Glu Val 65 70 75 80Glu Arg Val Tyr Leu Ser Ser Gly Lys Pro Leu Val Pro Asp Tyr Ala 85 90 95Met Xaa Phe Ile Lys Gly Lys Xaa Xaa Lys Pro Phe Gly Arg Leu Trp 100 105110 Trp Asp Xaa Thr Xaa Pro Thr Val Cys Asp Xaa Ser Arg Xaa Xaa Asn 115120 125 Gln Val Ile Leu His Pro Thr Xaa Ala Arg Val Leu Thr Ile Arg Glu130 135 140 Asn Ala Lys Gly Thr Gly Ala Phe Pro Ile Thr Thr Arg Leu XaaTrp 145 150 155 160 Thr Asp Gln Gly Glu Val Tyr Ser Ser Pro Glu Arg GlnXaa Xaa Ser 165 170 175 Leu Val Ala Arg Ala Leu Gly Tyr 180 <210> SEQ IDNO 87 <211> LENGTH: 433 <212> TYPE: DNA <213> ORGANISM: Oryza sativa<220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (433) <400>SEQUENCE: 87 cttacaattt ttattaggct gtcattctaa ggagaaatta cccccttttccactgcctac 60 gcatgaggcg attgtgaaga atggctgccc gttggctttt gagcgtaatttggttggttg 120 gcccaatgac acaccaatgc aactagcaag accaattgtc cttgaagacattctttctga 180 tctcccagaa gtggcaaatg gggaaagccg tgatgaaatg ctgtatgtaaagggtcctca 240 aactgaattc caaagataca ttcggtcatt taatgttgaa gtgcaaggtcccagagctca 300 atgttacaaa aagattccaa atcctcaaaa ttgttatgat catcgaccattggtactggg 360 ataatgatta ctaccaaaag ggatattgca aattcccaaa gaaaaaagggaagctaattt 420 taagagacct ttn 433 <210> SEQ ID NO 88 <211> LENGTH: 141<212> TYPE: PRT <213> ORGANISM: Oryza sativa <220> FEATURE: <221>NAME/KEY: UNSURE <222> LOCATION: (121)..(122) <400> SEQUENCE: 88 Glu ValAla Asn Gly Glu Ser Arg Asp Glu Met Leu Tyr Val Lys Gly 1 5 10 15 ProGln Thr Glu Phe Gln Arg Tyr Ile Arg Ser Phe Asn Val Glu Val 20 25 30 GlnGly Pro Arg Ala Gln Cys Tyr Lys Lys Ile Pro Asn Pro Gln Asn 35 40 45 CysTyr Asp His Arg Pro Leu Val Leu Gly Phe Leu Leu Gly Cys His 50 55 60 SerLys Glu Lys Leu Pro Pro Phe Pro Leu Pro Thr His Glu Ala Ile 65 70 75 80Val Lys Asn Gly Cys Pro Leu Ala Phe Glu Arg Asn Leu Val Gly Trp 85 90 95Pro Asn Asp Thr Pro Met Gln Leu Ala Arg Pro Ile Val Leu Glu Asp 100 105110 Ile Leu Ser Asp Leu Pro Glu Val Xaa Xaa Leu Leu Pro Lys Gly Ile 115120 125 Leu Gln Ile Pro Lys Glu Lys Arg Glu Ala Asn Phe Lys 130 135 140<210> SEQ ID NO 89 <211> LENGTH: 1042 <212> TYPE: DNA <213> ORGANISM:Triticum aestivum <400> SEQUENCE: 89 gcacgagctg gggtgctctt tcttccatggtgctccctaa gtatcctctg cctacatatg 60 acgttgtagt acgtggagga gcacctaatgccttttcgca atgtatggtt gcatatgatg 120 agacacaaag gccatccctg aagaaagctttgcttcttgg tgatgcattt tcagatttac 180 caaaggtcga aaaccatcaa cctaacgatgtaatggagta tggtggttcc cccaagacag 240 agttccagcg ctacattcga cttggtcgtaaagacatgtt ggattggtcg tttggtgagg 300 aggctggtcc agatgaaggc aagctcttggatcaccagcc cttacggctt aacaatgatg 360 attatgagcg ggttaagcaa attcctgtcaagaagggagc caacttccgt gacctaaagg 420 gtgtcaaggt tggagcaaat aatgttgttgagtgggatcc agaagtcgaa cgtgtgtacc 480 tttcgtctgg gaaaccactg gttcctgactatgcgatgtc attcatcaag ggcaaatcac 540 tcaagccatt cgggcgcctg tggtgggacgagacggttcc tacagttgtg accagagcag 600 agcctcataa ccaggttata ttgcatccgactcaagcaag agtcttgact atccgggaga 660 acgcaaggtt acagggcttc cccgattactaccgattgtt tggaccgatc aaggagaagt 720 atattcaagt cgggaacgca gtggcagtccctgttgcacg ggcactgggc tactgtctgg 780 gtcaagccta cctgggtgaa tctgacgggagtcagcctct gtaccagctg cctgcaagtt 840 ttacctctgt ggggcgaacc gcggttcaggcgaatgccgt ttctgttggc actcctgcgg 900 gggaggtagt cgagcagtaa aaggatagcggagcaaccct ggttggtatt ttgattcgag 960 cccatccagt agcatgttta ccaataaataatcattggtc gtgctgattc ttatggttgg 1020 agatgaatgt atgtagggtg ta 1042<210> SEQ ID NO 90 <211> LENGTH: 305 <212> TYPE: PRT <213> ORGANISM:Triticum aestivum <400> SEQUENCE: 90 Thr Ser Trp Gly Ala Leu Ser Ser MetVal Leu Pro Lys Tyr Pro Leu 1 5 10 15 Pro Thr Tyr Asp Val Val Val ArgGly Gly Ala Pro Asn Ala Phe Ser 20 25 30 Gln Cys Met Val Ala Tyr Asp GluThr Gln Arg Pro Ser Leu Lys Lys 35 40 45 Ala Leu Leu Leu Gly Asp Ala PheSer Asp Leu Pro Lys Val Glu Asn 50 55 60 His Gln Pro Asn Asp Val Met GluTyr Gly Gly Ser Pro Lys Thr Glu 65 70 75 80 Phe Gln Arg Tyr Ile Arg LeuGly Arg Lys Asp Met Leu Asp Trp Ser 85 90 95 Phe Gly Glu Glu Ala Gly ProAsp Glu Gly Lys Leu Leu Asp His Gln 100 105 110 Pro Leu Arg Leu Asn AsnAsp Asp Tyr Glu Arg Val Lys Gln Ile Pro 115 120 125 Val Lys Lys Gly AlaAsn Phe Arg Asp Leu Lys Gly Val Lys Val Gly 130 135 140 Ala Asn Asn ValVal Glu Trp Asp Pro Glu Val Glu Arg Val Tyr Leu 145 150 155 160 Ser SerGly Lys Pro Leu Val Pro Asp Tyr Ala Met Ser Phe Ile Lys 165 170 175 GlyLys Ser Leu Lys Pro Phe Gly Arg Leu Trp Trp Asp Glu Thr Val 180 185 190Pro Thr Val Val Thr Arg Ala Glu Pro His Asn Gln Val Ile Leu His 195 200205 Pro Thr Gln Ala Arg Val Leu Thr Ile Arg Glu Asn Ala Arg Leu Gln 210215 220 Gly Phe Pro Asp Tyr Tyr Arg Leu Phe Gly Pro Ile Lys Glu Lys Tyr225 230 235 240 Ile Gln Val Gly Asn Ala Val Ala Val Pro Val Ala Arg AlaLeu Gly 245 250 255 Tyr Cys Leu Gly Gln Ala Tyr Leu Gly Glu Ser Asp GlySer Gln Pro 260 265 270 Leu Tyr Gln Leu Pro Ala Ser Phe Thr Ser Val GlyArg Thr Ala Val 275 280 285 Gln Ala Asn Ala Val Ser Val Gly Thr Pro AlaGly Glu Val Val Glu 290 295 300 Gln 305 <210> SEQ ID NO 91 <211> LENGTH:681 <212> TYPE: DNA <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 91gcacgaggtt tcgtggaaca gtcaaagaca ggtatcgcca gattgggaat gctgtggccg 60tacccgttgg ccgtgcactt gggtatgcct tggccatggc caatctgaac aagactggaa 120atgatcccct catggcgcta ccacctaagt ttgcattctc tcataatata gaaggtactc 180cttagtcaga ctggcaaaac ctaacggttg acttgtccac ccgggtttaa gtctcgtcat 240ctccgttcat tctctgctag aaaagctggg tcatcgctgc acctcatggt cgtgaatacc 300attaggactc gtgaatatcg ttagggcatg cccattttaa cacaagtctt gattcatttt 360attgccatgt tgtaccttgt aacatcttag tttcttagaa gtttccgtag gcaggtgtaa 420gttgctcagt aaggtttctc ctgagagaac cttgtgttgt attgtaaaag aatgtatgta 480tcataacttt gtcacacaca tttgttagaa gctcacatgg gcaggtgtag gtggctgcag 540tgagttcctc tgcctttgtt gtactataaa agactctatg tatgtatgta tgtatgtata 600gacaagctgg gtggtcaggg gaataaaatg tacaccttct gaaaaaaaaa aaaaaaaaac 660tcgagactag ttctctctct c 681 <210> SEQ ID NO 92 <211> LENGTH: 60 <212>TYPE: PRT <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 92 Thr ArgPhe Arg Gly Thr Val Lys Asp Arg Tyr Arg Gln Ile Gly Asn 1 5 10 15 AlaVal Ala Val Pro Val Gly Arg Ala Leu Gly Tyr Ala Leu Ala Met 20 25 30 AlaAsn Leu Asn Lys Thr Gly Asn Asp Pro Leu Met Ala Leu Pro Pro 35 40 45 LysPhe Ala Phe Ser His Asn Ile Glu Gly Thr Pro 50 55 60 <210> SEQ ID NO 93<211> LENGTH: 2687 <212> TYPE: DNA <213> ORGANISM: Helianthus tuberosus<400> SEQUENCE: 93 gcacgagtca tcaccaaaat ggcgaaagga gtgaagagaaacacgaaaca ggaagccatt 60 gattcatcca cacattcttc cgatacaaaa ccatcgcctccgttaaagaa atgcaaaaca 120 tctccgtcgg ttgctgtttc ggccgatgaa gaggctcggttcgttggaaa accggttccg 180 gctgagcaag cgagggggaa atggcctcat cgatatgaatcgaagaataa agtgaaggtt 240 atttcgtcat ctgatggtga ggagaaggag atttttcaagctaagtgcca ttataccaaa 300 gctactgttg atggtatttc ttttgatctt tatgatgatgcttatgtcaa ggctgaagaa 360 gggaagccag attacattgc tcggatcgtg gagatgtttgaaaccgttga taaagagtta 420 tatttctccg ctcagtggtt tttcagagct gaagatacggttattaaaag ccaagcccac 480 cttatcgaca aaagacgagt gttctattcc gaaatgaaagatgacaatcc actggatagt 540 attgtatcga aattcaagat tgttcaactt cctccaaatgttgacttggt tgagaaagag 600 aaggcactct catcgtatga ttattactat gatatgcagtattcaaagcc ggttacattt 660 acaactctac acaaagaaaa cttaaccaca gagagtggcgagtcatccgt agtttcagat 720 gatgcgtgct caaacggtgt agtagagagt aacaataaaaatgcaaaacc aacaaaaatc 780 aacgacagtg aaaagtctga aatgacccta ttggacttgtattcgggttg tggtgccatg 840 tcaaccgggc tttgttatgg cacaaatatg gctggtgtaaaacttgtgac aaaatgggca 900 gttgacatca atgaacatgc atgtgaaagt ctgaagctgaaccatgctga aactcaggtg 960 agaaatgaag cagctgatga ctttttattg ttgctgaaagaatgggagaa actttgtaaa 1020 caatttggtt tgttgggttc aaaacgcgat gatacaaatgtaaaatctga agaatcagat 1080 tctgaggaga ttgatgagag tcgtgatccg tataagggtgaatttgaagt acaacgattg 1140 acggctgttt gttatggtga tccaaacaaa gccaacaaacaaaagctgca ttttaaggtg 1200 cgatggaagg gctttggccc tagttatgac acatgggagccagttgatgg cttgagtaac 1260 tgtgaggagt ctataaagga ttttgttatc aaaggatacaaatcaagaat gctaccactt 1320 cctggtgatg ttgacttcat atgtggaggt cctccatgtcaaggaatcag tggtcataat 1380 cggtttagaa actatactga tcctttaaag gatccaaagaatcaccagct tgtagtctac 1440 atggatatta ttgatttttt gaaaccaaaa tttgttttgatggagaatgt ttgtgatctt 1500 gtcaaatttg cggatggtat tttgggatat catgctgttgggcgtttagt ttcgatgaat 1560 tatcaaacac gtatggggat aatggcagct ggatcttatggagttcctca atgcagacta 1620 agggtctttc tttggggtgc taatacgatg atgaatttgcctcagtttcc attaccaaca 1680 cacgaggttg ttggaagagg agttgttcct gttgagttcaaggactgtat tgtcgggtct 1740 gataatgata actcgtacaa gttagagaag agcatacgtcttggtgatgc aatttcagac 1800 ttgcctgagg ttacaaacaa caaaggtaag gatgaaatggagtatgcagg tgccccacaa 1860 acaagttttc agaaatacat tagattgaga aagcaagctttggggaaaga ttcttcaaaa 1920 agaatgatgc tttacgatca taggccattg gaactaaacgaagatgatta tgctcgggtt 1980 tgccatattc caaagataaa gggagcaaac tttagggatttaccaggagt aagagttggt 2040 aaagggaaca aagtagaatt ggaccctgac gttgaaagggtcttgttacc ttcaggaaaa 2100 cctttggtcc caaattatgc tataacattt gttcgtggaacttcaaaaaa accctttggt 2160 cgcttgggta tggatgatat tgtcacaact gtagttggtagagctgagcc acataatcag 2220 gcgctgcttc atcctaacca ggatagagtg ttgaccattcgtgaaaatgc acgtttacaa 2280 ggatttcctg atcattataa actttgtgga cctgttaaagcaaggtacat gcaaattgga 2340 aatgcagttt cattctcggt atcaactgga ttggggtacagcttagccaa agcaattcaa 2400 ggagtttgca ccagtaaacc aatcaaactc ccacataagtttccagattg tcttggacaa 2460 ctatcttcac tataaacaag acattcaagg gacaatgatgacaaaattgg tttagtcttt 2520 aatgaaagat tgaggttaat ttatgcattt aacttgcttatttcggcatc aattaattgt 2580 atgattttca tcatattaga gtgtaattca acattgattcttagtaagtt ttgttgatat 2640 gagcaactat gcttatattt tttctagaaa aaaaaaaaaaaaaaaaa 2687 <210> SEQ ID NO 94 <211> LENGTH: 818 <212> TYPE: PRT <213>ORGANISM: Helianthus tuberosus <400> SEQUENCE: 94 Met Ala Lys Gly ValLys Arg Asn Thr Lys Gln Glu Ala Ile Asp Ser 1 5 10 15 Ser Thr His SerSer Asp Thr Lys Pro Ser Pro Pro Leu Lys Lys Cys 20 25 30 Lys Thr Ser ProSer Val Ala Val Ser Ala Asp Glu Glu Ala Arg Phe 35 40 45 Val Gly Lys ProVal Pro Ala Glu Gln Ala Arg Gly Lys Trp Pro His 50 55 60 Arg Tyr Glu SerLys Asn Lys Val Lys Val Ile Ser Ser Ser Asp Gly 65 70 75 80 Glu Glu LysGlu Ile Phe Gln Ala Lys Cys His Tyr Thr Lys Ala Thr 85 90 95 Val Asp GlyIle Ser Phe Asp Leu Tyr Asp Asp Ala Tyr Val Lys Ala 100 105 110 Glu GluGly Lys Pro Asp Tyr Ile Ala Arg Ile Val Glu Met Phe Glu 115 120 125 ThrVal Asp Lys Glu Leu Tyr Phe Ser Ala Gln Trp Phe Phe Arg Ala 130 135 140Glu Asp Thr Val Ile Lys Ser Gln Ala His Leu Ile Asp Lys Arg Arg 145 150155 160 Val Phe Tyr Ser Glu Met Lys Asp Asp Asn Pro Leu Asp Ser Ile Val165 170 175 Ser Lys Phe Lys Ile Val Gln Leu Pro Pro Asn Val Asp Leu ValGlu 180 185 190 Lys Glu Lys Ala Leu Ser Ser Tyr Asp Tyr Tyr Tyr Asp MetGln Tyr 195 200 205 Ser Lys Pro Val Thr Phe Thr Thr Leu His Lys Glu AsnLeu Thr Thr 210 215 220 Glu Ser Gly Glu Ser Ser Val Val Ser Asp Asp AlaCys Ser Asn Gly 225 230 235 240 Val Val Glu Ser Asn Asn Lys Asn Ala LysPro Thr Lys Ile Asn Asp 245 250 255 Ser Glu Lys Ser Glu Met Thr Leu LeuAsp Leu Tyr Ser Gly Cys Gly 260 265 270 Ala Met Ser Thr Gly Leu Cys TyrGly Thr Asn Met Ala Gly Val Lys 275 280 285 Leu Val Thr Lys Trp Ala ValAsp Ile Asn Glu His Ala Cys Glu Ser 290 295 300 Leu Lys Leu Asn His AlaGlu Thr Gln Val Arg Asn Glu Ala Ala Asp 305 310 315 320 Asp Phe Leu LeuLeu Leu Lys Glu Trp Glu Lys Leu Cys Lys Gln Phe 325 330 335 Gly Leu LeuGly Ser Lys Arg Asp Asp Thr Asn Val Lys Ser Glu Glu 340 345 350 Ser AspSer Glu Glu Ile Asp Glu Ser Arg Asp Pro Tyr Lys Gly Glu 355 360 365 PheGlu Val Gln Arg Leu Thr Ala Val Cys Tyr Gly Asp Pro Asn Lys 370 375 380Ala Asn Lys Gln Lys Leu His Phe Lys Val Arg Trp Lys Gly Phe Gly 385 390395 400 Pro Ser Tyr Asp Thr Trp Glu Pro Val Asp Gly Leu Ser Asn Cys Glu405 410 415 Glu Ser Ile Lys Asp Phe Val Ile Lys Gly Tyr Lys Ser Arg MetLeu 420 425 430 Pro Leu Pro Gly Asp Val Asp Phe Ile Cys Gly Gly Pro ProCys Gln 435 440 445 Gly Ile Ser Gly His Asn Arg Phe Arg Asn Tyr Thr AspPro Leu Lys 450 455 460 Asp Pro Lys Asn His Gln Leu Val Val Tyr Met AspIle Ile Asp Phe 465 470 475 480 Leu Lys Pro Lys Phe Val Leu Met Glu AsnVal Cys Asp Leu Val Lys 485 490 495 Phe Ala Asp Gly Ile Leu Gly Tyr HisAla Val Gly Arg Leu Val Ser 500 505 510 Met Asn Tyr Gln Thr Arg Met GlyIle Met Ala Ala Gly Ser Tyr Gly 515 520 525 Val Pro Gln Cys Arg Leu ArgVal Phe Leu Trp Gly Ala Asn Thr Met 530 535 540 Met Asn Leu Pro Gln PhePro Leu Pro Thr His Glu Val Val Gly Arg 545 550 555 560 Gly Val Val ProVal Glu Phe Lys Asp Cys Ile Val Gly Ser Asp Asn 565 570 575 Asp Asn SerTyr Lys Leu Glu Lys Ser Ile Arg Leu Gly Asp Ala Ile 580 585 590 Ser AspLeu Pro Glu Val Thr Asn Asn Lys Gly Lys Asp Glu Met Glu 595 600 605 TyrAla Gly Ala Pro Gln Thr Ser Phe Gln Lys Tyr Ile Arg Leu Arg 610 615 620Lys Gln Ala Leu Gly Lys Asp Ser Ser Lys Arg Met Met Leu Tyr Asp 625 630635 640 His Arg Pro Leu Glu Leu Asn Glu Asp Asp Tyr Ala Arg Val Cys His645 650 655 Ile Pro Lys Ile Lys Gly Ala Asn Phe Arg Asp Leu Pro Gly ValArg 660 665 670 Val Gly Lys Gly Asn Lys Val Glu Leu Asp Pro Asp Val GluArg Val 675 680 685 Leu Leu Pro Ser Gly Lys Pro Leu Val Pro Asn Tyr AlaIle Thr Phe 690 695 700 Val Arg Gly Thr Ser Lys Lys Pro Phe Gly Arg LeuGly Met Asp Asp 705 710 715 720 Ile Val Thr Thr Val Val Gly Arg Ala GluPro His Asn Gln Ala Leu 725 730 735 Leu His Pro Asn Gln Asp Arg Val LeuThr Ile Arg Glu Asn Ala Arg 740 745 750 Leu Gln Gly Phe Pro Asp His TyrLys Leu Cys Gly Pro Val Lys Ala 755 760 765 Arg Tyr Met Gln Ile Gly AsnAla Val Ser Phe Ser Val Ser Thr Gly 770 775 780 Leu Gly Tyr Ser Leu AlaLys Ala Ile Gln Gly Val Cys Thr Ser Lys 785 790 795 800 Pro Ile Lys LeuPro His Lys Phe Pro Asp Cys Leu Gly Gln Leu Ser 805 810 815 Ser Leu<210> SEQ ID NO 95 <211> LENGTH: 1071 <212> TYPE: DNA <213> ORGANISM:Zea mays <400> SEQUENCE: 95 ccacgcgtcc gggagtatgg tggttccccc aagacagagttccagcgcta cattcgactt 60 ggtcgtaaag acatgttgga ttggtcgttt ggtgaggaggctggtccaga tgaaggcaag 120 ctcttggatc accagccctt acggcttaac aatgatgattatgagcgggt taagcaaatt 180 cctgtcaaga agggagccaa cttccgtgac ctaaagggtgtcaaggttgg agcaaataat 240 gttgttgagt gggatccaga agtcgaacgt gtgtacctttcgtctgggaa accactggtt 300 cctgactatg cgatgtcatt catcaagggc aaatcactcaagccattcgg gcgcctgtgg 360 tgggacgaga cggttcctac agttgtgacc agagcagagcctcataacca ggttatattg 420 catccgactc aagcaagagt cttgactatc cgggagaacgcaaggttaca gggcttcccc 480 gattactacc gattgtttgg accgatcaag gagaagtatattcaagtcgg gaacgcagtg 540 gcagtccctg ttgcacgggc actgggctac tgtctgggtcaagcctacct gggtgaatct 600 gacgggagtc agcctctgta ccagctgcct gcaagttttacctctgtggg gcgaaccgcg 660 gttcaggcga atgccgtttc tgttggcact cctgcgggggaggtagtcga gcagtaaaag 720 gatagcggag caaccctggt tggtattttg attcgagcccatccagtagc atgtttacca 780 ataaataatc attggtcgtg ctgattctta tggttggagatgaatgtatg tagggtgtac 840 tcgagctcga gtgcttgttg tactgtaggt tgaggtttctcatccattgg cctgcctatt 900 tgtggatgac gtttcatttc agattagcaa tgtgcttatttaaggtttcg tcatgtacct 960 gtattctaca atccactatt gtttccaaag acagcatttgatccttgttc aacgcgagca 1020 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaaaaaaaaaaaa g 1071 <210> SEQ ID NO 96 <211> LENGTH: 238 <212> TYPE: PRT<213> ORGANISM: Zea mays <400> SEQUENCE: 96 Pro Arg Val Arg Glu Tyr GlyGly Ser Pro Lys Thr Glu Phe Gln Arg 1 5 10 15 Tyr Ile Arg Leu Gly ArgLys Asp Met Leu Asp Trp Ser Phe Gly Glu 20 25 30 Glu Ala Gly Pro Asp GluGly Lys Leu Leu Asp His Gln Pro Leu Arg 35 40 45 Leu Asn Asn Asp Asp TyrGlu Arg Val Lys Gln Ile Pro Val Lys Lys 50 55 60 Gly Ala Asn Phe Arg AspLeu Lys Gly Val Lys Val Gly Ala Asn Asn 65 70 75 80 Val Val Glu Trp AspPro Glu Val Glu Arg Val Tyr Leu Ser Ser Gly 85 90 95 Lys Pro Leu Val ProAsp Tyr Ala Met Ser Phe Ile Lys Gly Lys Ser 100 105 110 Leu Lys Pro PheGly Arg Leu Trp Trp Asp Glu Thr Val Pro Thr Val 115 120 125 Val Thr ArgAla Glu Pro His Asn Gln Val Ile Leu His Pro Thr Gln 130 135 140 Ala ArgVal Leu Thr Ile Arg Glu Asn Ala Arg Leu Gln Gly Phe Pro 145 150 155 160Asp Tyr Tyr Arg Leu Phe Gly Pro Ile Lys Glu Lys Tyr Ile Gln Val 165 170175 Gly Asn Ala Val Ala Val Pro Val Ala Arg Ala Leu Gly Tyr Cys Leu 180185 190 Gly Gln Ala Tyr Leu Gly Glu Ser Asp Gly Ser Gln Pro Leu Tyr Gln195 200 205 Leu Pro Ala Ser Phe Thr Ser Val Gly Arg Thr Ala Val Gln AlaAsn 210 215 220 Ala Val Ser Val Gly Thr Pro Ala Gly Glu Val Val Glu Gln225 230 235 <210> SEQ ID NO 97 <211> LENGTH: 1291 <212> TYPE: DNA <213>ORGANISM: Oryza sativa <400> SEQUENCE: 97 gcacgagctt acaatttttattaggctgtc attctaagga gaaattaccc ccttttccac 60 tgcctacgca tgaggcgattgtgaagaatg gctgcccgtt ggcttttgag cgtaatttgg 120 ttggttggcc caatgacacaccaatgcaac tagcaagacc aattgtcctt gaagacattc 180 tttctgatct cccagaagtggcaaatgggg aaagccgtga tgaaatgctg tatgtaaagg 240 gtcctcaaac tgaattccaaagatacattc ggtcatttaa tgttgaagtg catggtccca 300 gagctcatgt tacaaaagattccaaatcct caaaattgta tgatcatcga ccattggtac 360 tggataatga taactaccaaaggatattgc aaattcccaa gagaaaggga gctaatttta 420 gagacctttc aggagtcatagttggtcctg acaatgtggc tagattagat ccaacgaaag 480 aaagagtact tttgccatctggtcgtcctc tggccttgat acatcctgca caagacagac 540 tgcttactat tcgtgaaagtgctcggttgc aaggctttcc tgacaattac aggtttcgtg 600 gaacagtcaa agacaggtatcgccagattg gaaatgcggt ggccgtacct gttggacgag 660 cacttggata cgccctggcgatggcctacc tgaagaagtc tggagacgat ccccttatgt 720 tgctgccgcc caactttgcattctcccatg acttaagagg gtttgcctag tcaaacaccc 780 ttgtcagaag gcacaaatttgagttccttc cctcgttggt ttgattgaaa ttcaccatct 840 acattaatgc tctgctacaaaagctgctcc tcttcttgtt gccacccttt ctcgtgaata 900 gtgaatagca ttaggagttaggacatgcca tttatctgtt gggatcagat caaggaatct 960 gggttcccca atctgatgatcagctgtttg ttgtagcaaa atcattcctt gatttacagt 1020 agttgtgaca tcccccacactagttaattg gaaggattcc atagggcaag tgtagtgact 1080 catatcggga gcctatgttgtagtgtaaaa gaatgtatgt acaagttaga tgattaggaa 1140 aataaaatgt atggacatcatcttacctgt tgttgtgtta gccagcgatg aacacaaatg 1200 taacgacttc aataaatctacattgagctg cagatttcca atgggcctaa aaaaaaaaaa 1260 aaaaagaaaa aaaaaaaaaaaaaaaaaaaa a 1291 <210> SEQ ID NO 98 <211> LENGTH: 255 <212> TYPE: PRT<213> ORGANISM: Oryza sativa <400> SEQUENCE: 98 Thr Ser Leu Gln Phe LeuLeu Gly Cys His Ser Lys Glu Lys Leu Pro 1 5 10 15 Pro Phe Pro Leu ProThr His Glu Ala Ile Val Lys Asn Gly Cys Pro 20 25 30 Leu Ala Phe Glu ArgAsn Leu Val Gly Trp Pro Asn Asp Thr Pro Met 35 40 45 Gln Leu Ala Arg ProIle Val Leu Glu Asp Ile Leu Ser Asp Leu Pro 50 55 60 Glu Val Ala Asn GlyGlu Ser Arg Asp Glu Met Leu Tyr Val Lys Gly 65 70 75 80 Pro Gln Thr GluPhe Gln Arg Tyr Ile Arg Ser Phe Asn Val Glu Val 85 90 95 His Gly Pro ArgAla His Val Thr Lys Asp Ser Lys Ser Ser Lys Leu 100 105 110 Tyr Asp HisArg Pro Leu Val Leu Asp Asn Asp Asn Tyr Gln Arg Ile 115 120 125 Leu GlnIle Pro Lys Arg Lys Gly Ala Asn Phe Arg Asp Leu Ser Gly 130 135 140 ValIle Val Gly Pro Asp Asn Val Ala Arg Leu Asp Pro Thr Lys Glu 145 150 155160 Arg Val Leu Leu Pro Ser Gly Arg Pro Leu Ala Leu Ile His Pro Ala 165170 175 Gln Asp Arg Leu Leu Thr Ile Arg Glu Ser Ala Arg Leu Gln Gly Phe180 185 190 Pro Asp Asn Tyr Arg Phe Arg Gly Thr Val Lys Asp Arg Tyr ArgGln 195 200 205 Ile Gly Asn Ala Val Ala Val Pro Val Gly Arg Ala Leu GlyTyr Ala 210 215 220 Leu Ala Met Ala Tyr Leu Lys Lys Ser Gly Asp Asp ProLeu Met Leu 225 230 235 240 Leu Pro Pro Asn Phe Ala Phe Ser His Asp LeuArg Gly Phe Ala 245 250 255 <210> SEQ ID NO 99 <211> LENGTH: 1324 <212>TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 99 gcacgaggtcgttgatattt tgaagttttc tggtggttat ctggggcgtt atgccatagg 60 ccgtcttgtagctatgaatt atcaagcaag aatgggcatg atggcagcag ggtcttatgg 120 ccttccacaatttcgtatgc gtgttttcct ttggggagct cgtcctactg agcaattgcc 180 tccatatccgttaccaacgc atgaggtggt gtcaagaggt tttgttccca ctgagtttga 240 agaaattacagtagcttatg ataaaaagga tacctgtcag ctggctggtg ctcttttact 300 tgatgatgcaatatcagatc tcccacctgt tacaaatgat gagaaccagg atgaaaggaa 360 ctatggagctcctgctcgaa ctgagtttca aagatatatt agattgaaga aaaacgagat 420 ggtgggtagcatggctactg ctcaaagcac accacgtaga atactgtatg atcatcgtcc 480 tcttcaattgaacaaagatg attatgacag agtttgccag attccccaga agaagggtgc 540 aaacttcagagatctacctg gagtacttgt gaatggcaac aaagttgaat gggatccatc 600 ggttgaaagagtgatgctag actctggaaa gcctttggtt cctgattatg caatgacatt 660 tgtccgtgggacctctacta agccatttgg tcggttgtgg tgggatgaaa ttgtgccaac 720 agtggtgacaagggccgagc ctcacaacca ggccattctc caccctagac aaaaccgagt 780 acttaccattagagagaatg cgagactaca aggatttcct gattgctata aactttgtgg 840 gcctgtcaaagagaggtaca tacaagttgg aaatgctgtt gctgttcctg ttgctctagc 900 attgggatacacgtttggtt tggcctgcca gggactgtct gacgataagc ctttgacaac 960 cctccccttcaagtatccaa gttgtcttgc cctttcatct cttgcaggaa ctgagaatga 1020 taacgaatcaagttgagatt gttcccacac aatcccttct ctctgccatt gccatgttga 1080 acttttccatatttattcaa attttccatt ttaatgtaag agcgagtgta cactagaagg 1140 ccgaaaagttgtatcaatga tgattgtatg tacactaatg acattgttta taatgatata 1200 agattcactttgttaatggt tttgcaaaat ttaagttgta ttaaaaacaa aatgtattta 1260 agaattaattaaacttgaga tcatttgatc aaagttcttg ttataaaaaa aaaaaaaaaa 1320 aaaa 1324<210> SEQ ID NO 100 <211> LENGTH: 344 <212> TYPE: PRT <213> ORGANISM:Glycine max <400> SEQUENCE: 100 His Glu Val Val Asp Ile Leu Lys Phe SerGly Gly Tyr Leu Gly Arg 1 5 10 15 Tyr Ala Ile Gly Arg Leu Val Ala MetAsn Tyr Gln Ala Arg Met Gly 20 25 30 Met Met Ala Ala Gly Ser Tyr Gly LeuPro Gln Phe Arg Met Arg Val 35 40 45 Phe Leu Trp Gly Ala Arg Pro Thr GluGln Leu Pro Pro Tyr Pro Leu 50 55 60 Pro Thr His Glu Val Val Ser Arg GlyPhe Val Pro Thr Glu Phe Glu 65 70 75 80 Glu Ile Thr Val Ala Tyr Asp LysLys Asp Thr Cys Gln Leu Ala Gly 85 90 95 Ala Leu Leu Leu Asp Asp Ala IleSer Asp Leu Pro Pro Val Thr Asn 100 105 110 Asp Glu Asn Gln Asp Glu ArgAsn Tyr Gly Ala Pro Ala Arg Thr Glu 115 120 125 Phe Gln Arg Tyr Ile ArgLeu Lys Lys Asn Glu Met Val Gly Ser Met 130 135 140 Ala Thr Ala Gln SerThr Pro Arg Arg Ile Leu Tyr Asp His Arg Pro 145 150 155 160 Leu Gln LeuAsn Lys Asp Asp Tyr Asp Arg Val Cys Gln Ile Pro Gln 165 170 175 Lys LysGly Ala Asn Phe Arg Asp Leu Pro Gly Val Leu Val Asn Gly 180 185 190 AsnLys Val Glu Trp Asp Pro Ser Val Glu Arg Val Met Leu Asp Ser 195 200 205Gly Lys Pro Leu Val Pro Asp Tyr Ala Met Thr Phe Val Arg Gly Thr 210 215220 Ser Thr Lys Pro Phe Gly Arg Leu Trp Trp Asp Glu Ile Val Pro Thr 225230 235 240 Val Val Thr Arg Ala Glu Pro His Asn Gln Ala Ile Leu His ProArg 245 250 255 Gln Asn Arg Val Leu Thr Ile Arg Glu Asn Ala Arg Leu GlnGly Phe 260 265 270 Pro Asp Cys Tyr Lys Leu Cys Gly Pro Val Lys Glu ArgTyr Ile Gln 275 280 285 Val Gly Asn Ala Val Ala Val Pro Val Ala Leu AlaLeu Gly Tyr Thr 290 295 300 Phe Gly Leu Ala Cys Gln Gly Leu Ser Asp AspLys Pro Leu Thr Thr 305 310 315 320 Leu Pro Phe Lys Tyr Pro Ser Cys LeuAla Leu Ser Ser Leu Ala Gly 325 330 335 Thr Glu Asn Asp Asn Glu Ser Ser340 <210> SEQ ID NO 101 <211> LENGTH: 527 <212> TYPE: DNA <213>ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION:(435) <221> NAME/KEY: unsure <222> LOCATION: (506) <400> SEQUENCE: 101cattgcatct cttctggaga tgggatacca ggtccggttt ggaattctag aagcaggggc 60ttttggtgtt gcccagtcca ggaaaagggc gtttatttgg gctgctgcac ctggagagat 120gcttcctgat tggccagagc cgatgcatgt gtttgctagc cctgagctga agataacact 180gcctgatggc caatactatg cagctgcaag aagcactgct ggtggagcgc ctttccgagc 240gattactgtt agagatacaa ttggggatct gcctaaagtg ggaaatggtg ccagcaaact 300cacgcttgag tatggaggtg agcccgtgtc ttggttccag aagaagataa gagggaatat 360gatggtactg aatgggcaca tatctaagga gatgaatgag ctgaacctaa taaggtgcca 420acacattccg aaacngccgg gttgtgattg gcatgaccta ccggacgaga aggttaagct 480gtcaaatggg cagatggctg acctgntacc ttggtgcctg cccaaca 527 <210> SEQ ID NO102 <211> LENGTH: 175 <212> TYPE: PRT <213> ORGANISM: Zea mays <220>FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (145) <221> NAME/KEY:UNSURE <222> LOCATION: (169) <400> SEQUENCE: 102 Ile Ala Ser Leu Leu GluMet Gly Tyr Gln Val Arg Phe Gly Ile Leu 1 5 10 15 Glu Ala Gly Ala PheGly Val Ala Gln Ser Arg Lys Arg Ala Phe Ile 20 25 30 Trp Ala Ala Ala ProGly Glu Met Leu Pro Asp Trp Pro Glu Pro Met 35 40 45 His Val Phe Ala SerPro Glu Leu Lys Ile Thr Leu Pro Asp Gly Gln 50 55 60 Tyr Tyr Ala Ala AlaArg Ser Thr Ala Gly Gly Ala Pro Phe Arg Ala 65 70 75 80 Ile Thr Val ArgAsp Thr Ile Gly Asp Leu Pro Lys Val Gly Asn Gly 85 90 95 Ala Ser Lys LeuThr Leu Glu Tyr Gly Gly Glu Pro Val Ser Trp Phe 100 105 110 Gln Lys LysIle Arg Gly Asn Met Met Val Leu Asn Gly His Ile Ser 115 120 125 Lys GluMet Asn Glu Leu Asn Leu Ile Arg Cys Gln His Ile Pro Lys 130 135 140 XaaPro Gly Cys Asp Trp His Asp Leu Pro Asp Glu Lys Val Lys Leu 145 150 155160 Ser Asn Gly Gln Met Ala Asp Leu Xaa Pro Trp Cys Leu Pro Asn 165 170175 <210> SEQ ID NO 103 <211> LENGTH: 791 <212> TYPE: DNA <213>ORGANISM: Oryza sativa <400> SEQUENCE: 103 gcacgaggct atgtcaaggggacaggctcc ttactagcta cgtccaataa cctcaaacga 60 atttccaaag aggaccttgaaatttcttca ctgaaggagt taggcttacg atttttcacc 120 cctcgtgagg ttgctaatctgcattctttt ccatcgagtt tccactttcc gaatcacata 180 agccttaggc aacagtatgctatgctgggc aatagcctga gtgtagcagt tgtggggcct 240 ttgctgcgtt atctgtttgctgagacatag tgatgttcaa tcagttggca catgatgcta 300 cacaggacaa acacattgaccttcctttgc ccgtgttatc accctaaagc caggttggtt 360 cgtgggtatg attcgtgtactccccgaagc tgatggttgt ggttcaaacc ttgcaaccca 420 attggagtga atttcccacagtggtgcaac ttcaacttca tagggcttgg ggacttgtcg 480 attcatcggt gggaagaagacagcgtgcat gatggcccgg catgtatcgg tgaaactctg 540 acgcgaatct tacctagtagagtagagaag cttcctcagc attttccctt tgccccaggc 600 ataagtttta ataataatgttgaatgtttg aggtgttaca gcatttgact catgcgcatt 660 gtgaaacgag gttattatggtttagaactg aacttcaact tcatgagctc accattcgac 720 tgtctcaaaa aaaaaaaaaaaaaaaaaaaa aacaaaaaaa aaataaaaat gggggcgccg 780 tagccagtcg a 791 <210>SEQ ID NO 104 <211> LENGTH: 89 <212> TYPE: PRT <213> ORGANISM: Oryzasativa <400> SEQUENCE: 104 Ala Arg Gly Tyr Val Lys Gly Thr Gly Ser LeuLeu Ala Thr Ser Asn 1 5 10 15 Asn Leu Lys Arg Ile Ser Lys Glu Asp LeuGlu Ile Ser Ser Leu Lys 20 25 30 Glu Leu Gly Leu Arg Phe Phe Thr Pro ArgGlu Val Ala Asn Leu His 35 40 45 Ser Phe Pro Ser Ser Phe His Phe Pro AsnHis Ile Ser Leu Arg Gln 50 55 60 Gln Tyr Ala Met Leu Gly Asn Ser Leu SerVal Ala Val Val Gly Pro 65 70 75 80 Leu Leu Arg Tyr Leu Phe Ala Glu Thr85 <210> SEQ ID NO 105 <211> LENGTH: 533 <212> TYPE: DNA <213> ORGANISM:Glycine max <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (402)<221> NAME/KEY: unsure <222> LOCATION: (423) <221> NAME/KEY: unsure<222> LOCATION: (433) <221> NAME/KEY: unsure <222> LOCATION: (447) <221>NAME/KEY: unsure <222> LOCATION: (508) <221> NAME/KEY: unsure <222>LOCATION: (514) <221> NAME/KEY: unsure <222> LOCATION: (530) <400>SEQUENCE: 105 aaagctaatc atcctgaggc attggtgttc attaacaatt gcaatgttattcttagggct 60 gtaatggaga agtgtgggga cacagatgat tgtatctcaa catctgaggctgcagaattg 120 gctgcaaagc ttgatgagaa ggaaataagt agtttaccaa tgcctggacaagttgatttc 180 attaatggtg gtcctccatg ccagggtttc tctgggatga ataggtttaaccagagcagt 240 tggagtaaag tccagtgtga gatgatattg gcattcttat cctttgctgattatttccgg 300 ccaaggtatt tcttgttgga gaatgtgagg aactttgtgt ctttcaataaaggacagaca 360 ttccgtttaa ctttggcttc acttcttgag atgggttatc angtgaggtttggtatcctt 420 gangctggag canttggggt ttctcantca agaaaaaggg cattcatatgggctgcttct 480 cctgaagatg tgcttcccga atgggcanaa ccantgcaag tcctttcggnccc 533 <210> SEQ ID NO 106 <211> LENGTH: 176 <212> TYPE: PRT <213>ORGANISM: Glycine max <220> FEATURE: <221> NAME/KEY: UNSURE <222>LOCATION: (134) <221> NAME/KEY: UNSURE <222> LOCATION: (141) <221>NAME/KEY: UNSURE <222> LOCATION: (145) <221> NAME/KEY: UNSURE <222>LOCATION: (149) <221> NAME/KEY: UNSURE <222> LOCATION: (170) <221>NAME/KEY: UNSURE <222> LOCATION: (172) <400> SEQUENCE: 106 Lys Ala AsnHis Pro Glu Ala Leu Val Phe Ile Asn Asn Cys Asn Val 1 5 10 15 Ile LeuArg Ala Val Met Glu Lys Cys Gly Asp Thr Asp Asp Cys Ile 20 25 30 Ser ThrSer Glu Ala Ala Glu Leu Ala Ala Lys Leu Asp Glu Lys Glu 35 40 45 Ile SerSer Leu Pro Met Pro Gly Gln Val Asp Phe Ile Asn Gly Gly 50 55 60 Pro ProCys Gln Gly Phe Ser Gly Met Asn Arg Phe Asn Gln Ser Ser 65 70 75 80 TrpSer Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu Ser Phe Ala 85 90 95 AspTyr Phe Arg Pro Arg Tyr Phe Leu Leu Glu Asn Val Arg Asn Phe 100 105 110Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Thr Leu Ala Ser Leu 115 120125 Leu Glu Met Gly Tyr Xaa Val Arg Phe Gly Ile Leu Xaa Ala Gly Ala 130135 140 Xaa Gly Val Ser Xaa Ser Arg Lys Arg Ala Phe Ile Trp Ala Ala Ser145 150 155 160 Pro Glu Asp Val Leu Pro Glu Trp Ala Xaa Pro Xaa Gln ValLeu Ser 165 170 175 <210> SEQ ID NO 107 <211> LENGTH: 1343 <212> TYPE:DNA <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 107 ttcggcacgaggagtaaagt tcagtgtgag atgattttag cattcctctc ttttgcggag 60 tattttcgtcccagattctt tcttttagaa aatgtcagga attttgtttc cttcaacaaa 120 ggacagaccttccgactggc agttgcatct cttctggaaa tgggatacca ggttcgtttt 180 ggaatcttagaagcaggtac ttttggtgtt gctcagtcca ggaaaagggc attcatttgg 240 gctgctgctcctggagagat tcttcctgat tggccagaac cgatgcaagt gtttgctagc 300 cctgaactgaaaataacact gcctgatggc aaatactatg cagctgccaa aagcactgct 360 ggtggggcgcctttccgtgc aataactgtt agagatacaa ttggggattt gccgaaagtg 420 gaaaatggtgcaagtaaact catacttgag tatggaggtg agcctacctc ttggtttcag 480 aagaagatcagaggtagcac tattgcattg aacgatcaca tatctaagga gatgaatgaa 540 ttaaatctcataagatgcaa acacattccc aaacgacctg gttgtgactg gcatgacctg 600 ccagatgagaaggtgaagct atcttctggg caaatggtgg acctgatacc ttggtgcttg 660 cctaacaccgctaaaaggca caatcagtgg aagggtctgt atgggaggtt agattgggag 720 ggcaatttccccacttctgt gactgatcct cagccgatgg gcaaggttgg catgtgcttc 780 catcctgaccaggacaggat tatcacggtc cgtgaatgtg cgcgatctca gggctttcct 840 gacagctaccagttttcggg caccattcag agcaagcaca ggcagattgg caacgctgtg 900 ccaccccctcttgcctttgc gcttgggagg aagctgaagg aagccgtcga tgcgaagcgt 960 cagcaggcctgacagcgtcc gctagaactc ttgagggaca gaagcaccag cattcaagtt 1020 tctcctgtattgtgcatggc tggcatgaac tgtcccgaac cagcattcaa gccacatact 1080 gatgtgacgatgtgaactat catgcatgaa ctttcttagt cgattgtgat tttattggtc 1140 tccttgtatatccttagatt attgttgatt ggcgcccaaa ttcagagttg tcattgtatc 1200 caaacattgcacgagcgtac gtctactagt ggctgttgca tgtggtgtga catgttctgg 1260 agctaagatataaaacgttt cctcgtactc gatatttggg gatagaagtc ttttgtgtgt 1320 atgaaaaaaaaaaaaaaaaa aaa 1343 <210> SEQ ID NO 108 <211> LENGTH: 323 <212> TYPE:PRT <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 108 Phe Gly ThrArg Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu 1 5 10 15 Ser PheAla Glu Tyr Phe Arg Pro Arg Phe Phe Leu Leu Glu Asn Val 20 25 30 Arg AsnPhe Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Ala Val 35 40 45 Ala SerLeu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Ile Leu Glu 50 55 60 Ala GlyThr Phe Gly Val Ala Gln Ser Arg Lys Arg Ala Phe Ile Trp 65 70 75 80 AlaAla Ala Pro Gly Glu Ile Leu Pro Asp Trp Pro Glu Pro Met Gln 85 90 95 ValPhe Ala Ser Pro Glu Leu Lys Ile Thr Leu Pro Asp Gly Lys Tyr 100 105 110Tyr Ala Ala Ala Lys Ser Thr Ala Gly Gly Ala Pro Phe Arg Ala Ile 115 120125 Thr Val Arg Asp Thr Ile Gly Asp Leu Pro Lys Val Glu Asn Gly Ala 130135 140 Ser Lys Leu Ile Leu Glu Tyr Gly Gly Glu Pro Thr Ser Trp Phe Gln145 150 155 160 Lys Lys Ile Arg Gly Ser Thr Ile Ala Leu Asn Asp His IleSer Lys 165 170 175 Glu Met Asn Glu Leu Asn Leu Ile Arg Cys Lys His IlePro Lys Arg 180 185 190 Pro Gly Cys Asp Trp His Asp Leu Pro Asp Glu LysVal Lys Leu Ser 195 200 205 Ser Gly Gln Met Val Asp Leu Ile Pro Trp CysLeu Pro Asn Thr Ala 210 215 220 Lys Arg His Asn Gln Trp Lys Gly Leu TyrGly Arg Leu Asp Trp Glu 225 230 235 240 Gly Asn Phe Pro Thr Ser Val ThrAsp Pro Gln Pro Met Gly Lys Val 245 250 255 Gly Met Cys Phe His Pro AspGln Asp Arg Ile Ile Thr Val Arg Glu 260 265 270 Cys Ala Arg Ser Gln GlyPhe Pro Asp Ser Tyr Gln Phe Ser Gly Thr 275 280 285 Ile Gln Ser Lys HisArg Gln Ile Gly Asn Ala Val Pro Pro Pro Leu 290 295 300 Ala Phe Ala LeuGly Arg Lys Leu Lys Glu Ala Val Asp Ala Lys Arg 305 310 315 320 Gln GlnAla <210> SEQ ID NO 109 <211> LENGTH: 500 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <220> FEATURE: <221> NAME/KEY: unsure <222>LOCATION: (438) <221> NAME/KEY: unsure <222> LOCATION: (500) <400>SEQUENCE: 109 cagaagccta cgagaaccac agtccagccg accagccgcc gccgcgcgataaccggagtt 60 aatggagacg ccgcccccat ggagggtcct cgagttctac agcggtatcggcggcatgcg 120 gtactccctt gcgtcgtcgg gcgttcgagc ggaggtggtg gaggcctttgacatcaacga 180 cgtcgccaac gacgtctacg agcacaactt cggccaccgc ccctgccagggaaacattca 240 aacactcact gctggtgatc tagacaagta caaggcacat gcatggctcctttctcctcc 300 atgtcaacca tatacacgga cttcagaaac attcagctga tgctcgggcattttcattta 360 taaagatttt aaaccttatg caaaacatga gctttcctcc acaaatgttatttgtggaaa 420 atgttgtccg gattcgangt ttttttaagc ttaaaattat tgaatccacagttgactgta 480 gtattggact ctacaatatn 500 <210> SEQ ID NO 110 <211>LENGTH: 79 <212> TYPE: PRT <213> ORGANISM: Triticum aestivum <400>SEQUENCE: 110 Arg Val Leu Glu Phe Tyr Ser Gly Ile Gly Gly Met Arg TyrSer Leu 1 5 10 15 Ala Ser Ser Gly Val Arg Ala Glu Val Val Glu Ala PheAsp Ile Asn 20 25 30 Asp Val Ala Asn Asp Val Tyr Glu His Asn Phe Gly HisArg Pro Cys 35 40 45 Gln Gly Asn Ile Gln Thr Leu Thr Ala Gly Asp Leu AspLys Tyr Lys 50 55 60 Ala His Ala Trp Leu Leu Ser Pro Pro Cys Gln Pro TyrThr Arg 65 70 75 <210> SEQ ID NO 111 <211> LENGTH: 791 <212> TYPE: DNA<213> ORGANISM: Oryza sativa <400> SEQUENCE: 111 gcacgaggct atgtcaaggggacaggctcc ttactagcta cgtccaataa cctcaaacga 60 atttccaaag aggaccttgaaatttcttca ctgaaggagt taggcttacg atttttcacc 120 cctcgtgagg ttgctaatctgcattctttt ccatcgagtt tccactttcc gaatcacata 180 agccttaggc aacagtatgctatgctgggc aatagcctga gtgtagcagt tgtggggcct 240 ttgctgcgtt atctgtttgctgagacatag tgatgttcaa tcagttggca catgatgcta 300 cacaggacaa acacattgaccttcctttgc ccgtgttatc accctaaagc caggttggtt 360 cgtgggtatg attcgtgtactccccgaagc tgatggttgt ggttcaaacc ttgcaaccca 420 attggagtga atttcccacagtggtgcaac ttcaacttca tagggcttgg ggacttgtcg 480 attcatcggt gggaagaagacagcgtgcat gatggcccgg catgtatcgg tgaaactctg 540 acgcgaatct tacctagtagagtagagaag cttcctcagc attttccctt tgccccaggc 600 ataagtttta ataataatgttgaatgtttg aggtgttaca gcatttgact catgcgcatt 660 gtgaaacgag gttattatggtttagaactg aacttcaact tcatgagctc accattcgac 720 tgtctcaaaa aaaaaaaaaaaaaaaaaaaa aacaaaaaaa aaataaaaat gggggcgccg 780 tagccagtcg a 791 <210>SEQ ID NO 112 <211> LENGTH: 89 <212> TYPE: PRT <213> ORGANISM: Oryzasativa <400> SEQUENCE: 112 Ala Arg Gly Tyr Val Lys Gly Thr Gly Ser LeuLeu Ala Thr Ser Asn 1 5 10 15 Asn Leu Lys Arg Ile Ser Lys Glu Asp LeuGlu Ile Ser Ser Leu Lys 20 25 30 Glu Leu Gly Leu Arg Phe Phe Thr Pro ArgGlu Val Ala Asn Leu His 35 40 45 Ser Phe Pro Ser Ser Phe His Phe Pro AsnHis Ile Ser Leu Arg Gln 50 55 60 Gln Tyr Ala Met Leu Gly Asn Ser Leu SerVal Ala Val Val Gly Pro 65 70 75 80 Leu Leu Arg Tyr Leu Phe Ala Glu Thr85 <210> SEQ ID NO 113 <211> LENGTH: 1492 <212> TYPE: DNA <213>ORGANISM: Glycine max <400> SEQUENCE: 113 tttaaagcta atcatcctgaggcattggtg ttcattaaca attgcaatgt tattcttagg 60 gctgtaatgg agaagtgtggggacacagat gattgtatct caacatctga ggctgcagaa 120 ttggctgcaa agcttgatgagaaggaaata agtagtttac caatgcctgg acaagttgat 180 ttcattaatg gtggtcctccatgccagggt ttctctggga tgaataggtt taaccagagc 240 agttggagta aagtccagtgtgagatgata ttggcattct tatcctttgc tgattatttc 300 cggccaaggt atttcttgttggagaatgtg aggaactttg tgtctttcaa taaaggacag 360 acattccgtt taactttggcttcacttctt gagatgggtt atcaggtgag gtttggtatc 420 cttgaggctg gagcatttggggtttctcag tcaagaaaaa gggcattcat atgggctgct 480 tctcctgagg atgtgcttcctgaatggcca gaaccagtgc acgtcttttc ggcccctgag 540 ttgaagatca cattatcagaaaatgtccag tatgctgctg tccgcagtac tgcaaatggt 600 gctcctttac gtgcaataactgttcgagat actattggtg atctcccagc tgttggcaat 660 ggagcctcaa aaggaaacatggagtatcaa aatgatccag tctcatggtt tcaaaagaag 720 attcgaggtg atatggttgtcttgactgat catatatcaa aggagatgaa cgaattgaac 780 ttgattcgat gccagaaaattcccaagaga ccaggtgctg attggcgtga ccttccagaa 840 gaaaagataa aactgtctagtggacaagtt gttgatttga taccatggtg cttgccaaac 900 acggctaagc ggcacaatcaatggaaggga ctgtttggca ggttggattg gcaaggaaat 960 ttcccaactt ccgttacagaccctcagcca atggggaagg ttggaatgtg cttccaccct 1020 gaccaagata ggattcttactgttcgtgaa tgtgctcggt ctcaaggctt cccagatagc 1080 tatgaatttg ctggcaatatcatacacaag caccggcaga ttggtaatgc tgtgcctcct 1140 cctctagcat cagcattggggagaaagctc aaggaagcag tggacagtaa gagctccact 1200 tagaagatgg gccttctacattaggtcatg cttattgtat tcataacagt caccaagata 1260 ttgcaaatca tcattccgggttccaaaaac tagaaacccc ttgtatatag tgatatccat 1320 tggccctttt ttgggggtaattcccttttt taactttccc cacccaagga attgaatggg 1380 tggtgcccct tttttttcaactggggtttt tttggtttaa aaaaaaaaaa aaaaaaaaaa 1440 aaaaaaaaaa aaaaaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1492 <210> SEQ ID NO 114 <211>LENGTH: 400 <212> TYPE: PRT <213> ORGANISM: Glycine max <400> SEQUENCE:114 Phe Lys Ala Asn His Pro Glu Ala Leu Val Phe Ile Asn Asn Cys Asn 1 510 15 Val Ile Leu Arg Ala Val Met Glu Lys Cys Gly Asp Thr Asp Asp Cys 2025 30 Ile Ser Thr Ser Glu Ala Ala Glu Leu Ala Ala Lys Leu Asp Glu Lys 3540 45 Glu Ile Ser Ser Leu Pro Met Pro Gly Gln Val Asp Phe Ile Asn Gly 5055 60 Gly Pro Pro Cys Gln Gly Phe Ser Gly Met Asn Arg Phe Asn Gln Ser 6570 75 80 Ser Trp Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu Ser Phe85 90 95 Ala Asp Tyr Phe Arg Pro Arg Tyr Phe Leu Leu Glu Asn Val Arg Asn100 105 110 Phe Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Thr Leu AlaSer 115 120 125 Leu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Ile Leu GluAla Gly 130 135 140 Ala Phe Gly Val Ser Gln Ser Arg Lys Arg Ala Phe IleTrp Ala Ala 145 150 155 160 Ser Pro Glu Asp Val Leu Pro Glu Trp Pro GluPro Val His Val Phe 165 170 175 Ser Ala Pro Glu Leu Lys Ile Thr Leu SerGlu Asn Val Gln Tyr Ala 180 185 190 Ala Val Arg Ser Thr Ala Asn Gly AlaPro Leu Arg Ala Ile Thr Val 195 200 205 Arg Asp Thr Ile Gly Asp Leu ProAla Val Gly Asn Gly Ala Ser Lys 210 215 220 Gly Asn Met Glu Tyr Gln AsnAsp Pro Val Ser Trp Phe Gln Lys Lys 225 230 235 240 Ile Arg Gly Asp MetVal Val Leu Thr Asp His Ile Ser Lys Glu Met 245 250 255 Asn Glu Leu AsnLeu Ile Arg Cys Gln Lys Ile Pro Lys Arg Pro Gly 260 265 270 Ala Asp TrpArg Asp Leu Pro Glu Glu Lys Ile Lys Leu Ser Ser Gly 275 280 285 Gln ValVal Asp Leu Ile Pro Trp Cys Leu Pro Asn Thr Ala Lys Arg 290 295 300 HisAsn Gln Trp Lys Gly Leu Phe Gly Arg Leu Asp Trp Gln Gly Asn 305 310 315320 Phe Pro Thr Ser Val Thr Asp Pro Gln Pro Met Gly Lys Val Gly Met 325330 335 Cys Phe His Pro Asp Gln Asp Arg Ile Leu Thr Val Arg Glu Cys Ala340 345 350 Arg Ser Gln Gly Phe Pro Asp Ser Tyr Glu Phe Ala Gly Asn IleIle 355 360 365 His Lys His Arg Gln Ile Gly Asn Ala Val Pro Pro Pro LeuAla Ser 370 375 380 Ala Leu Gly Arg Lys Leu Lys Glu Ala Val Asp Ser LysSer Ser Thr 385 390 395 400 <210> SEQ ID NO 115 <211> LENGTH: 1343 <212>TYPE: DNA <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 115ttcggcacga ggagtaaagt tcagtgtgag atgattttag cattcctctc ttttgcggag 60tattttcgtc ccagattctt tcttttagaa aatgtcagga attttgtttc cttcaacaaa 120ggacagacct tccgactggc agttgcatct cttctggaaa tgggatacca ggttcgtttt 180ggaatcttag aagcaggtac ttttggtgtt gctcagtcca ggaaaagggc attcatttgg 240gctgctgctc ctggagagat tcttcctgat tggccagaac cgatgcaagt gtttgctagc 300cctgaactga aaataacact gcctgatggc aaatactatg cagctgccaa aagcactgct 360ggtggggcgc ctttccgtgc aataactgtt agagatacaa ttggggattt gccgaaagtg 420gaaaatggtg caagtaaact catacttgag tatggaggtg agcctacctc ttggtttcag 480aagaagatca gaggtagcac tattgcattg aacgatcaca tatctaagga gatgaatgaa 540ttaaatctca taagatgcaa acacattccc aaacgacctg gttgtgactg gcatgacctg 600ccagatgaga aggtgaagct atcttctggg caaatggtgg acctgatacc ttggtgcttg 660cctaacaccg ctaaaaggca caatcagtgg aagggtctgt atgggaggtt agattgggag 720ggcaatttcc ccacttctgt gactgatcct cagccgatgg gcaaggttgg catgtgcttc 780catcctgacc aggacaggat tatcacggtc cgtgaatgtg cgcgatctca gggctttcct 840gacagctacc agttttcggg caccattcag agcaagcaca ggcagattgg caacgctgtg 900ccaccccctc ttgcctttgc gcttgggagg aagctgaagg aagccgtcga tgcgaagcgt 960cagcaggcct gacagcgtcc gctagaactc ttgagggaca gaagcaccag cattcaagtt 1020tctcctgtat tgtgcatggc tggcatgaac tgtcccgaac cagcattcaa gccacatact 1080gatgtgacga tgtgaactat catgcatgaa ctttcttagt cgattgtgat tttattggtc 1140tccttgtata tccttagatt attgttgatt ggcgcccaaa ttcagagttg tcattgtatc 1200caaacattgc acgagcgtac gtctactagt ggctgttgca tgtggtgtga catgttctgg 1260agctaagata taaaacgttt cctcgtactc gatatttggg gatagaagtc ttttgtgtgt 1320atgaaaaaaa aaaaaaaaaa aaa 1343 <210> SEQ ID NO 116 <211> LENGTH: 323<212> TYPE: PRT <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 116Phe Gly Thr Arg Ser Lys Val Gln Cys Glu Met Ile Leu Ala Phe Leu 1 5 1015 Ser Phe Ala Glu Tyr Phe Arg Pro Arg Phe Phe Leu Leu Glu Asn Val 20 2530 Arg Asn Phe Val Ser Phe Asn Lys Gly Gln Thr Phe Arg Leu Ala Val 35 4045 Ala Ser Leu Leu Glu Met Gly Tyr Gln Val Arg Phe Gly Ile Leu Glu 50 5560 Ala Gly Thr Phe Gly Val Ala Gln Ser Arg Lys Arg Ala Phe Ile Trp 65 7075 80 Ala Ala Ala Pro Gly Glu Ile Leu Pro Asp Trp Pro Glu Pro Met Gln 8590 95 Val Phe Ala Ser Pro Glu Leu Lys Ile Thr Leu Pro Asp Gly Lys Tyr100 105 110 Tyr Ala Ala Ala Lys Ser Thr Ala Gly Gly Ala Pro Phe Arg AlaIle 115 120 125 Thr Val Arg Asp Thr Ile Gly Asp Leu Pro Lys Val Glu AsnGly Ala 130 135 140 Ser Lys Leu Ile Leu Glu Tyr Gly Gly Glu Pro Thr SerTrp Phe Gln 145 150 155 160 Lys Lys Ile Arg Gly Ser Thr Ile Ala Leu AsnAsp His Ile Ser Lys 165 170 175 Glu Met Asn Glu Leu Asn Leu Ile Arg CysLys His Ile Pro Lys Arg 180 185 190 Pro Gly Cys Asp Trp His Asp Leu ProAsp Glu Lys Val Lys Leu Ser 195 200 205 Ser Gly Gln Met Val Asp Leu IlePro Trp Cys Leu Pro Asn Thr Ala 210 215 220 Lys Arg His Asn Gln Trp LysGly Leu Tyr Gly Arg Leu Asp Trp Glu 225 230 235 240 Gly Asn Phe Pro ThrSer Val Thr Asp Pro Gln Pro Met Gly Lys Val 245 250 255 Gly Met Cys PheHis Pro Asp Gln Asp Arg Ile Ile Thr Val Arg Glu 260 265 270 Cys Ala ArgSer Gln Gly Phe Pro Asp Ser Tyr Gln Phe Ser Gly Thr 275 280 285 Ile GlnSer Lys His Arg Gln Ile Gly Asn Ala Val Pro Pro Pro Leu 290 295 300 AlaPhe Ala Leu Gly Arg Lys Leu Lys Glu Ala Val Asp Ala Lys Arg 305 310 315320 Gln Gln Ala <210> SEQ ID NO 117 <211> LENGTH: 1285 <212> TYPE: DNA<213> ORGANISM: Triticum aestivum <400> SEQUENCE: 117 gcacgagcagaagcctacga gaaccacagt ccagccgacc agccgccgcc gcgcgataac 60 cggagttaatggagacgccg cccccatgga gggtcctcga gttctacagc ggtatcggcg 120 gcatgcggtactcccttgcg tcgtcgggcg ttcgagcgga ggtggtggag gcctttgaca 180 tcaacgacgtcgccaacgac gtctacgagc acaacttcgg ccaccgcccc tgccagggaa 240 acattcaaacactcactgct ggtgatctag acaagtacaa ggcacatgca tggctccttt 300 ctcctccatgtcaaccatat acacgacttc agaaacattc agctgatgct cgggcatttt 360 catttataaagattttaaac cttatgcaaa acatgagctt tcctccacaa atgttatttg 420 tggaaaatgttgtcggattc gaggtatctg atacacatga ccagttgcta gcggtccttt 480 caactctcagtttcaacaca caagaattca tcctaagccc cttgcagttt ggtgtcccat 540 attctagaccgcgctacttc tgtttggcaa aacaggaatc tatgtgtttt ccaaatccat 600 cagtcaatgacaagctgctt aggacaccta catgcctaac attgaacact acgagaactc 660 agaatagctatgaccagaat gaagatgatc tggaagtagt atgtaatcca ataagaaact 720 tccttgaagcacagagtatt ggagataagg aatcttcagc catcatctct gactttaagg 780 aggctgatggatgcactccg attgaaactg cttcacatga ctacacagtt ccactaagct 840 tgattgaacggtggggaaat gctatggata ttgtataccc tgaatcaaaa cggtgctgct 900 gttttactaaaagttattat cgctatgtga agggcacagg ctctgtactg gttacatcta 960 aaagcctcaaaccagttcca aaagagaacc ttgaaatgtc ttcactgagt gagttgggtc 1020 tacggtttttcacccctcga gaggtcgcta atttgcattc atttcccccg agtttccgtt 1080 ttccggatcagataagcctc agacaacagt atgccatgct gggtaatagt ctgagcatag 1140 cggttgtggctcctttgttg tgctatctgt ttgccgagac atagacaagt tctcaatcat 1200 ttggcaggtgatattcacct tggcccgcca tctgatattt gtcaaaatct tgccacactt 1260 gagttgcaagtatgatattt tatcc 1285 <210> SEQ ID NO 118 <211> LENGTH: 371 <212> TYPE:PRT <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 118 Met Glu ThrPro Pro Pro Trp Arg Val Leu Glu Phe Tyr Ser Gly Ile 1 5 10 15 Gly GlyMet Arg Tyr Ser Leu Ala Ser Ser Gly Val Arg Ala Glu Val 20 25 30 Val GluAla Phe Asp Ile Asn Asp Val Ala Asn Asp Val Tyr Glu His 35 40 45 Asn PheGly His Arg Pro Cys Gln Gly Asn Ile Gln Thr Leu Thr Ala 50 55 60 Gly AspLeu Asp Lys Tyr Lys Ala His Ala Trp Leu Leu Ser Pro Pro 65 70 75 80 CysGln Pro Tyr Thr Arg Leu Gln Lys His Ser Ala Asp Ala Arg Ala 85 90 95 PheSer Phe Ile Lys Ile Leu Asn Leu Met Gln Asn Met Ser Phe Pro 100 105 110Pro Gln Met Leu Phe Val Glu Asn Val Val Gly Phe Glu Val Ser Asp 115 120125 Thr His Asp Gln Leu Leu Ala Val Leu Ser Thr Leu Ser Phe Asn Thr 130135 140 Gln Glu Phe Ile Leu Ser Pro Leu Gln Phe Gly Val Pro Tyr Ser Arg145 150 155 160 Pro Arg Tyr Phe Cys Leu Ala Lys Gln Glu Ser Met Cys PhePro Asn 165 170 175 Pro Ser Val Asn Asp Lys Leu Leu Arg Thr Pro Thr CysLeu Thr Leu 180 185 190 Asn Thr Thr Arg Thr Gln Asn Ser Tyr Asp Gln AsnGlu Asp Asp Leu 195 200 205 Glu Val Val Cys Asn Pro Ile Arg Asn Phe LeuGlu Ala Gln Ser Ile 210 215 220 Gly Asp Lys Glu Ser Ser Ala Ile Ile SerAsp Phe Lys Glu Ala Asp 225 230 235 240 Gly Cys Thr Pro Ile Glu Thr AlaSer His Asp Tyr Thr Val Pro Leu 245 250 255 Ser Leu Ile Glu Arg Trp GlyAsn Ala Met Asp Ile Val Tyr Pro Glu 260 265 270 Ser Lys Arg Cys Cys CysPhe Thr Lys Ser Tyr Tyr Arg Tyr Val Lys 275 280 285 Gly Thr Gly Ser ValLeu Val Thr Ser Lys Ser Leu Lys Pro Val Pro 290 295 300 Lys Glu Asn LeuGlu Met Ser Ser Leu Ser Glu Leu Gly Leu Arg Phe 305 310 315 320 Phe ThrPro Arg Glu Val Ala Asn Leu His Ser Phe Pro Pro Ser Phe 325 330 335 ArgPhe Pro Asp Gln Ile Ser Leu Arg Gln Gln Tyr Ala Met Leu Gly 340 345 350Asn Ser Leu Ser Ile Ala Val Val Ala Pro Leu Leu Cys Tyr Leu Phe 355 360365 Ala Glu Thr 370 <210> SEQ ID NO 119 <211> LENGTH: 469 <212> TYPE:DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 119 attcggatctatctcacacc ctttctctct tttcttcttt tctttcgttt tcccttcgca 60 agtgagagatggcgcaaatt ctgcttcatg ggactctcca cgccaccgtc ttcgaggttg 120 ataggctcaatgctggtggt ggtggcggca attttttcag caagctcaag caaaactttg 180 aggagactgttggcatcgga aagggagtta ctaaactcta tgcaaccatt gatctggaga 240 aagcaagagtaggaaggact agaatcatag aaaatgaaca tactaatccc agatggtatg 300 agtcttttcacatttattgt gctcatatgg cttcaaatat catattcact gtgaaagatg 360 ataatcctattggggcaact ttaattggaa gagcttatgt gcctgtttcg gaggtcttgg 420 atggtgaggaaatagatagg tgggttgaaa tcttggacga ggaaaaaaa 469 <210> SEQ ID NO 120 <211>LENGTH: 133 <212> TYPE: PRT <213> ORGANISM: Glycine max <400> SEQUENCE:120 Met Ala Gln Ile Leu Leu His Gly Thr Leu His Ala Thr Val Phe Glu 1 510 15 Val Asp Arg Leu Asn Ala Gly Gly Gly Gly Gly Asn Phe Phe Ser Lys 2025 30 Leu Lys Gln Asn Phe Glu Glu Thr Val Gly Ile Gly Lys Gly Val Thr 3540 45 Lys Leu Tyr Ala Thr Ile Asp Leu Glu Lys Ala Arg Val Gly Arg Thr 5055 60 Arg Ile Ile Glu Asn Glu His Thr Asn Pro Arg Trp Tyr Glu Ser Phe 6570 75 80 His Ile Tyr Cys Ala His Met Ala Ser Asn Ile Ile Phe Thr Val Lys85 90 95 Asp Asp Asn Pro Ile Gly Ala Thr Leu Ile Gly Arg Ala Tyr Val Pro100 105 110 Val Ser Glu Val Leu Asp Gly Glu Glu Ile Asp Arg Trp Val GluIle 115 120 125 Leu Asp Glu Glu Lys 130 <210> SEQ ID NO 121 <211>LENGTH: 475 <212> TYPE: DNA <213> ORGANISM: Triticum aestivum <220>FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (349) <221> NAME/KEY:unsure <222> LOCATION: (426) <221> NAME/KEY: unsure <222> LOCATION:(429) <221> NAME/KEY: unsure <222> LOCATION: (461) <400> SEQUENCE: 121atctgcctcc ctcccgattc atcgtctccc gtcccctctt tgtcgtcctc ttcaccagcg 60gcaggcggtg gtggcaggtg actgtgacta gcaggagcgc ggaggagggg tccttcggtc 120gccatggctc agatcttgct ccatgggaac ctccacgtca ccatcttcga ggcctcctcg 180ctctcccacc ccggccgcgc cagcggcggc gcccccaatt catccgcaat ttgtagaggg 240atttgaggaa actgtcggtg ttggcaaagg aagtccaagc tatatgccac cattgatctc 300cgagaaactc gtgttgggcg tacaagatgt tggggcaacg acccgtcanc ctccgctggt 360acgattcgtt cacatctact gtgcgcactt gccgccgatg ttatctcaag ctgaaaggcg 420acaccnatnt gggcgacctc atgggaaggc gtactccttc ngaaactgga aggga 475 <210>SEQ ID NO 122 <211> LENGTH: 94 <212> TYPE: PRT <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (41)<221> NAME/KEY: UNSURE <222> LOCATION: (76) <400> SEQUENCE: 122 Met AlaGln Ile Leu Leu His Gly Asn Leu His Val Thr Ile Phe Glu 1 5 10 15 AlaSer Ser Leu Ser His Pro Gly Arg Ala Ser Gly Gly Ala Pro Asn 20 25 30 SerSer Ala Ile Cys Arg Gly Ile Xaa Gly Asn Cys Arg Cys Trp Gln 35 40 45 ArgLys Ser Lys Leu Tyr Ala Thr Ile Asp Leu Arg Glu Thr Arg Val 50 55 60 GlyArg Thr Arg Cys Trp Gly Asn Asp Pro Ser Xaa Ser Ala Gly Thr 65 70 75 80Ile Arg Ser His Leu Leu Cys Ala Leu Ala Ala Asp Val Ile 85 90 <210> SEQID NO 123 <211> LENGTH: 2736 <212> TYPE: DNA <213> ORGANISM: Glycine max<400> SEQUENCE: 123 gcacgagcac tcactcactc acctccattt ccattgtcactgctccttat tcggatctat 60 ctcacaccct ttctctcttt tcttcttttc tttcgttttcccttcgcaag tgagagatgg 120 cgcaaattct gcttcatggg actctccacg ccaccgtcttcgaggttgat aggctcaatg 180 ctggtggtgg tggcggcaat tttttcagca agctcaagcaaaactttgag gagactgttg 240 gcatcggaaa gggagttact aaactctatg caaccattgatctggagaaa gcaagagtag 300 gaaggactag aatcatagaa aatgaacata ctaatcccagatggtatgag tcttttcaca 360 tttattgtgc tcatatggct tcaaatatca tattcactgtgaaagatgat aatcctattg 420 gggcaacttt aattggaaga gcttatgtgc ctgtttcggaggtcttggat ggtgaggaaa 480 tagataggtg ggttgaaatc ttggacgagg aaaaaaacccaatacaagag ggttcaaaga 540 tccatgtgaa gctgcaatat tttgatgtca caaaagaccgcaacagggct cgaggcatta 600 gaagtcccaa attccccggc gttccctata ctttcttctcacagaggcaa ggatgtaagg 660 tatctctgta ccaagatgct catgtacctg ataattttgtacctaaaata cctcttgctg 720 gaggcaagaa ttatgaggct cataggtgtt gggaggatatatttgatgca atcactaatg 780 ctagacactt catatacatt actggttggt ctgtttatactgaaatttcc ttggtgaggg 840 attctaggag gccaaagcct gggggagacc aaacacttggtgagcttctc aagaaaaagg 900 caaatgaagg ggttaaggta ttgatgcttg tttgggatgatagaacatca gttggtttgt 960 tgaaaaaaga tggactaatg gctactcacg atgaagaaactgcacagttc tttgaaggca 1020 ctgaggtgca ttgtgtttta tgcccccgca atcctgatgatggtggtagc attgttcagg 1080 atttacaaat ttctaccatg tttactcatc accagaagatcgtggtggtt gacggtgcga 1140 tgccaggtga agggtcagat aggcgaagaa ttgtgagttttgttgggggt attgacctct 1200 gcgacggaag atatgacact gccttccact cacttttcagaaccctagac acagcacacc 1260 atgacgattt tcatcagcct aactttcctg gtgctgctatcacaaaaggt ggtcccaggg 1320 aaccatggca cgacatccac tcccggcttg aaggacccatagcttgggat gttttgttca 1380 actttgagca gagatggaga aagcaaggtg ggaaggatgtacttgttcca ctgagagagc 1440 ttgaagatgt cattattccc ccatccccag taacgtttcctgaagatcat gagacctgga 1500 atgttcagtt gtttagatcc attgatggtg gggctgcttttgggttcccg gagactcctg 1560 aagatgctgc cagagctggt cttattagtg ggaaggataatatcattgat cgtagcattc 1620 aagatgctta tattaatgct attcgacgtg caaagaacttcatctatatt gaaaatcagt 1680 atttccttgg aagctctttt gcctggagtg ctgatgatattaagcctgaa gacattggtg 1740 ctttgcatct aatcccaaag gaactttcac tcaagattgttagtaagatt gaagctgggg 1800 aaaggtttgc tgtgtatgtt gtagtcccaa tgtggccagagggtgttcca gaaagtgcat 1860 cagttcaggc aatattggac tggcagaaga gaacaatggagatgatgtac aaggacatta 1920 ttcaggcact cagagctaag ggaatcgatg aagatcctcgaaactatttg acattcttct 1980 gccttggcaa ccgggaagtg aagaaaccag gagaatatgagccttctgag caaccagatc 2040 ctgattcaga ttatcagaga gcccaagagg cccgacgattcatgatttat gttcatacca 2100 agatgatgat agttgatgac gaatacataa tcgttggatctgccaatatc aaccaaaggt 2160 caatggatgg tgctagggac tctgagattg ccatgggtgcttatcagccc tatcatttgg 2220 ctaccaggca gccagcacgt ggtcagattc acggtttccgcatgtcattg tggtatgagc 2280 accttggcat gcttcatgac tccttcctcc agccagaaagtgatgaatgc attaacaagg 2340 tgaaccaagt tgctgacaaa tattgggatc tgtattctaacgagtcactt gagcatgacc 2400 ttcccggtca ccttctccgc taccccatcg gggttgccagtgaaggagat gtcaccgagc 2460 tgccaggatt tgaattcttt cccgacacca aggctcgcattcttggtggc aaagctgact 2520 acctcccccc tattctcact acttaatttg attttgtactttgttacaaa cttgttattg 2580 cttacatcct aatctgtgtt ttcattacta cttagtaataatcttaatat agtctgtgat 2640 gtcttgtgga gatgtttttt ctgctatttt tctgaagatttcagtgacag attttagtta 2700 tctgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 2736<210> SEQ ID NO 124 <211> LENGTH: 809 <212> TYPE: PRT <213> ORGANISM:Glycine max <400> SEQUENCE: 124 Met Ala Gln Ile Leu Leu His Gly Thr LeuHis Ala Thr Val Phe Glu 1 5 10 15 Val Asp Arg Leu Asn Ala Gly Gly GlyGly Gly Asn Phe Phe Ser Lys 20 25 30 Leu Lys Gln Asn Phe Glu Glu Thr ValGly Ile Gly Lys Gly Val Thr 35 40 45 Lys Leu Tyr Ala Thr Ile Asp Leu GluLys Ala Arg Val Gly Arg Thr 50 55 60 Arg Ile Ile Glu Asn Glu His Thr AsnPro Arg Trp Tyr Glu Ser Phe 65 70 75 80 His Ile Tyr Cys Ala His Met AlaSer Asn Ile Ile Phe Thr Val Lys 85 90 95 Asp Asp Asn Pro Ile Gly Ala ThrLeu Ile Gly Arg Ala Tyr Val Pro 100 105 110 Val Ser Glu Val Leu Asp GlyGlu Glu Ile Asp Arg Trp Val Glu Ile 115 120 125 Leu Asp Glu Glu Lys AsnPro Ile Gln Glu Gly Ser Lys Ile His Val 130 135 140 Lys Leu Gln Tyr PheAsp Val Thr Lys Asp Arg Asn Arg Ala Arg Gly 145 150 155 160 Ile Arg SerPro Lys Phe Pro Gly Val Pro Tyr Thr Phe Phe Ser Gln 165 170 175 Arg GlnGly Cys Lys Val Ser Leu Tyr Gln Asp Ala His Val Pro Asp 180 185 190 AsnPhe Val Pro Lys Ile Pro Leu Ala Gly Gly Lys Asn Tyr Glu Ala 195 200 205His Arg Cys Trp Glu Asp Ile Phe Asp Ala Ile Thr Asn Ala Arg His 210 215220 Phe Ile Tyr Ile Thr Gly Trp Ser Val Tyr Thr Glu Ile Ser Leu Val 225230 235 240 Arg Asp Ser Arg Arg Pro Lys Pro Gly Gly Asp Gln Thr Leu GlyGlu 245 250 255 Leu Leu Lys Lys Lys Ala Asn Glu Gly Val Lys Val Leu MetLeu Val 260 265 270 Trp Asp Asp Arg Thr Ser Val Gly Leu Leu Lys Lys AspGly Leu Met 275 280 285 Ala Thr His Asp Glu Glu Thr Ala Gln Phe Phe GluGly Thr Glu Val 290 295 300 His Cys Val Leu Cys Pro Arg Asn Pro Asp AspGly Gly Ser Ile Val 305 310 315 320 Gln Asp Leu Gln Ile Ser Thr Met PheThr His His Gln Lys Ile Val 325 330 335 Val Val Asp Gly Ala Met Pro GlyGlu Gly Ser Asp Arg Arg Arg Ile 340 345 350 Val Ser Phe Val Gly Gly IleAsp Leu Cys Asp Gly Arg Tyr Asp Thr 355 360 365 Ala Phe His Ser Leu PheArg Thr Leu Asp Thr Ala His His Asp Asp 370 375 380 Phe His Gln Pro AsnPhe Pro Gly Ala Ala Ile Thr Lys Gly Gly Pro 385 390 395 400 Arg Glu ProTrp His Asp Ile His Ser Arg Leu Glu Gly Pro Ile Ala 405 410 415 Trp AspVal Leu Phe Asn Phe Glu Gln Arg Trp Arg Lys Gln Gly Gly 420 425 430 LysAsp Val Leu Val Pro Leu Arg Glu Leu Glu Asp Val Ile Ile Pro 435 440 445Pro Ser Pro Val Thr Phe Pro Glu Asp His Glu Thr Trp Asn Val Gln 450 455460 Leu Phe Arg Ser Ile Asp Gly Gly Ala Ala Phe Gly Phe Pro Glu Thr 465470 475 480 Pro Glu Asp Ala Ala Arg Ala Gly Leu Ile Ser Gly Lys Asp AsnIle 485 490 495 Ile Asp Arg Ser Ile Gln Asp Ala Tyr Ile Asn Ala Ile ArgArg Ala 500 505 510 Lys Asn Phe Ile Tyr Ile Glu Asn Gln Tyr Phe Leu GlySer Ser Phe 515 520 525 Ala Trp Ser Ala Asp Asp Ile Lys Pro Glu Asp IleGly Ala Leu His 530 535 540 Leu Ile Pro Lys Glu Leu Ser Leu Lys Ile ValSer Lys Ile Glu Ala 545 550 555 560 Gly Glu Arg Phe Ala Val Tyr Val ValVal Pro Met Trp Pro Glu Gly 565 570 575 Val Pro Glu Ser Ala Ser Val GlnAla Ile Leu Asp Trp Gln Lys Arg 580 585 590 Thr Met Glu Met Met Tyr LysAsp Ile Ile Gln Ala Leu Arg Ala Lys 595 600 605 Gly Ile Asp Glu Asp ProArg Asn Tyr Leu Thr Phe Phe Cys Leu Gly 610 615 620 Asn Arg Glu Val LysLys Pro Gly Glu Tyr Glu Pro Ser Glu Gln Pro 625 630 635 640 Asp Pro AspSer Asp Tyr Gln Arg Ala Gln Glu Ala Arg Arg Phe Met 645 650 655 Ile TyrVal His Thr Lys Met Met Ile Val Asp Asp Glu Tyr Ile Ile 660 665 670 ValGly Ser Ala Asn Ile Asn Gln Arg Ser Met Asp Gly Ala Arg Asp 675 680 685Ser Glu Ile Ala Met Gly Ala Tyr Gln Pro Tyr His Leu Ala Thr Arg 690 695700 Gln Pro Ala Arg Gly Gln Ile His Gly Phe Arg Met Ser Leu Trp Tyr 705710 715 720 Glu His Leu Gly Met Leu His Asp Ser Phe Leu Gln Pro Glu SerAsp 725 730 735 Glu Cys Ile Asn Lys Val Asn Gln Val Ala Asp Lys Tyr TrpAsp Leu 740 745 750 Tyr Ser Asn Glu Ser Leu Glu His Asp Leu Pro Gly HisLeu Leu Arg 755 760 765 Tyr Pro Ile Gly Val Ala Ser Glu Gly Asp Val ThrGlu Leu Pro Gly 770 775 780 Phe Glu Phe Phe Pro Asp Thr Lys Ala Arg IleLeu Gly Gly Lys Ala 785 790 795 800 Asp Tyr Leu Pro Pro Ile Leu Thr Thr805 <210> SEQ ID NO 125 <211> LENGTH: 2997 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 125 gcacgagatc tgcctccctcccgattcatc gtctcccgtc ccctctttgt cgtcctcttc 60 accagcggca ggcggtggtggcaggtgact gtgactagca ggagcgcgga ggaggggtcc 120 ttcggtcgcc atggctcagatcttgctcca tgggaacctc cacgtcacca tcttcgaggc 180 ctcctcgctc tcccaccccggccgcgccag cggcggcgcc cccaagttca tccgcaagtt 240 tgtagagggc attgaggaaactgtcggtgt tggcaaagga agctccaagc tatatgccac 300 cattgatctc gagaaagctcgtgttgggcg taccaggatg ttgggcaacg agcccgtcaa 360 tcctcgctgg tacgagtcgttccacatcta ctgtgcgcac cttgccgccg atgtgatctt 420 cacgctgaag gccgacaacgcgatcggggc gacgctcatt gggagggcgt acctgcctgt 480 cggagagctc ctggaaggggaggagatcga taggtggctt gaaatctgtg atgacaaccg 540 ggagcctgtt ggtgagagcaagatccatgt gaagcttcag tactttggcg ttgagaagga 600 ccgcaactgg gcgaggggtgtccggagcgt caagtttcct ggtgttcctt acaccttctt 660 ctcgcagagg caaggatgcaatgttagatt gtaccaagat gctcatgtcc cagacaactt 720 tatccccaag attccgcttgcggacggcaa gaactatgag cctgccagat gttgggagga 780 tatctttgat gccataagcaatgctcagca tttgatttac atcactggtt ggtctgtgca 840 cactgagatc accttgattagggacaccaa tcgccccaaa cctggaggag acgtcactct 900 cggagagtta ctcaagaggaaggccagcga aggtgtccgg gtccttatgc tagtgtggga 960 tgatagaact tcagttggcttgctgaagag agatggcttg atggccaccc atgatgagga 1020 gactgcaaat tacttccaaggcaccgatgt gcactgtgtt ctgtgccctc gtaaccccga 1080 tgattcaggc agcattgttcaggatctgca gatctcaacc atgttcactc accatcagaa 1140 gatagtatgt gttgacgatgcattgccaag ccagggctcc gagcaaagga ggatactcag 1200 cttcgttggt ggcattgacctctgcgacgg aagatatgac actcagtacc actccttgtt 1260 taggacactt gacactgtccaccatgatga cttccaccag cctaacttcg cgactgcatc 1320 catcaccaaa ggtggcccaagagagccatg gcatgatatt cattcacgat tggaaggtcc 1380 aattgcctgg gatgttctttacaattttga gcagagatgg agaaagcagg gtggcaaaga 1440 tcttctcgtg cagctcagggatctctctga cataattatc cccccttctc ccgtcatgtt 1500 cccagaggac agagatacatggaatgtcca gctcttcaga tctattgatg gtggtgctgc 1560 ttttggcttc cctgatactcctgaggaagc tgcaagggct gggcttgtaa gtggaaagga 1620 tcaaatcatt gacaggagcatccaggatgc atacataaat gccatccgac gggcaaagga 1680 cttcatctac attgagaaccaatacttcct tggaagttcc tactgctgga agcccgaagg 1740 catcaagcct gaagaaattggcgctctgca tgtgattcct aaggagcttt cgttgaagat 1800 tgtcagcaag attgaagccggagaacggtt tactgtttat gttgtggtgc caatgtggcc 1860 tgagggcatg ccagagagcgcatctgtaca agcaattctg gactggcaaa ggagaacaat 1920 ggagatgatg tacactgacatcacacaagc tctcgaggca aaggggattg aagcaaaccc 1980 caaggaatac ctcactttcttctgcctagg taaccgtgag gtgaagcagg atggggaata 2040 tgaaccccag gagcagccagaacctgatac tgattacgtc cgcgctcaag aggctaggag 2100 gttcatgatc tacgttcataccaaaatgat gatagttgat gacgagtaca tcatcattgg 2160 gtctgcaaac atcaaccaacggtcaatgga cggcgcccgg gactccgaga ttgccatggg 2220 cgcttaccag ccataccatctagccaacag ggagccggcc cggggccaga tccacggctt 2280 ccggatggca ctgtggtacgagcacctggg catgctggac gacgtgttcc agcgcccgga 2340 gagcgtcgag tgcgtgcagaaggtgaacag gatcgcggag aagtactggg acatatactc 2400 gagcgacgac ctggagcaggacctccccgg ccacctgctg agctacccca tcggcgtcgc 2460 cagcgacggc gtggtgacggagctgccggg catggagttc ttccccgaca cccgggcccg 2520 catcctcggc gccaagtcggactaccttcc ccccatcctc accacataga taagatctca 2580 tcctgcattt cctgtgtgtggcttcagttt ggtgattcag aacttgtgtt tcagaaaata 2640 gcaggctgtt atagttgcgggactgttaat aagcgcagtg gtgtgcatgg ttgagaacca 2700 cggtagtgac taggaggattgctgatactt tacacggttt ctgctgtttg tatcactgtt 2760 taattataaa ttgtcaatgtttagttgttt acgctcgtca tacctacttg gatctttaag 2820 ggtggacaat aaagttaacgcgtgtttctt attgtatgat ggatatatag atagttttgt 2880 ttttccaatg tatggtatcagagtgctgta tgtttggttt ttccactgaa taattaagct 2940 gtattaggct ggtcaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 2997 <210> SEQ ID NO 126 <211>LENGTH: 812 <212> TYPE: PRT <213> ORGANISM: Triticum aestivum <400>SEQUENCE: 126 Met Ala Gln Ile Leu Leu His Gly Asn Leu His Val Thr IlePhe Glu 1 5 10 15 Ala Ser Ser Leu Ser His Pro Gly Arg Ala Ser Gly GlyAla Pro Lys 20 25 30 Phe Ile Arg Lys Phe Val Glu Gly Ile Glu Glu Thr ValGly Val Gly 35 40 45 Lys Gly Ser Ser Lys Leu Tyr Ala Thr Ile Asp Leu GluLys Ala Arg 50 55 60 Val Gly Arg Thr Arg Met Leu Gly Asn Glu Pro Val AsnPro Arg Trp 65 70 75 80 Tyr Glu Ser Phe His Ile Tyr Cys Ala His Leu AlaAla Asp Val Ile 85 90 95 Phe Thr Leu Lys Ala Asp Asn Ala Ile Gly Ala ThrLeu Ile Gly Arg 100 105 110 Ala Tyr Leu Pro Val Gly Glu Leu Leu Glu GlyGlu Glu Ile Asp Arg 115 120 125 Trp Leu Glu Ile Cys Asp Asp Asn Arg GluPro Val Gly Glu Ser Lys 130 135 140 Ile His Val Lys Leu Gln Tyr Phe GlyVal Glu Lys Asp Arg Asn Trp 145 150 155 160 Ala Arg Gly Val Arg Ser ValLys Phe Pro Gly Val Pro Tyr Thr Phe 165 170 175 Phe Ser Gln Arg Gln GlyCys Asn Val Arg Leu Tyr Gln Asp Ala His 180 185 190 Val Pro Asp Asn PheIle Pro Lys Ile Pro Leu Ala Asp Gly Lys Asn 195 200 205 Tyr Glu Pro AlaArg Cys Trp Glu Asp Ile Phe Asp Ala Ile Ser Asn 210 215 220 Ala Gln HisLeu Ile Tyr Ile Thr Gly Trp Ser Val His Thr Glu Ile 225 230 235 240 ThrLeu Ile Arg Asp Thr Asn Arg Pro Lys Pro Gly Gly Asp Val Thr 245 250 255Leu Gly Glu Leu Leu Lys Arg Lys Ala Ser Glu Gly Val Arg Val Leu 260 265270 Met Leu Val Trp Asp Asp Arg Thr Ser Val Gly Leu Leu Lys Arg Asp 275280 285 Gly Leu Met Ala Thr His Asp Glu Glu Thr Ala Asn Tyr Phe Gln Gly290 295 300 Thr Asp Val His Cys Val Leu Cys Pro Arg Asn Pro Asp Asp SerGly 305 310 315 320 Ser Ile Val Gln Asp Leu Gln Ile Ser Thr Met Phe ThrHis His Gln 325 330 335 Lys Ile Val Cys Val Asp Asp Ala Leu Pro Ser GlnGly Ser Glu Gln 340 345 350 Arg Arg Ile Leu Ser Phe Val Gly Gly Ile AspLeu Cys Asp Gly Arg 355 360 365 Tyr Asp Thr Gln Tyr His Ser Leu Phe ArgThr Leu Asp Thr Val His 370 375 380 His Asp Asp Phe His Gln Pro Asn PheAla Thr Ala Ser Ile Thr Lys 385 390 395 400 Gly Gly Pro Arg Glu Pro TrpHis Asp Ile His Ser Arg Leu Glu Gly 405 410 415 Pro Ile Ala Trp Asp ValLeu Tyr Asn Phe Glu Gln Arg Trp Arg Lys 420 425 430 Gln Gly Gly Lys AspLeu Leu Val Gln Leu Arg Asp Leu Ser Asp Ile 435 440 445 Ile Ile Pro ProSer Pro Val Met Phe Pro Glu Asp Arg Asp Thr Trp 450 455 460 Asn Val GlnLeu Phe Arg Ser Ile Asp Gly Gly Ala Ala Phe Gly Phe 465 470 475 480 ProAsp Thr Pro Glu Glu Ala Ala Arg Ala Gly Leu Val Ser Gly Lys 485 490 495Asp Gln Ile Ile Asp Arg Ser Ile Gln Asp Ala Tyr Ile Asn Ala Ile 500 505510 Arg Arg Ala Lys Asp Phe Ile Tyr Ile Glu Asn Gln Tyr Phe Leu Gly 515520 525 Ser Ser Tyr Cys Trp Lys Pro Glu Gly Ile Lys Pro Glu Glu Ile Gly530 535 540 Ala Leu His Val Ile Pro Lys Glu Leu Ser Leu Lys Ile Val SerLys 545 550 555 560 Ile Glu Ala Gly Glu Arg Phe Thr Val Tyr Val Val ValPro Met Trp 565 570 575 Pro Glu Gly Met Pro Glu Ser Ala Ser Val Gln AlaIle Leu Asp Trp 580 585 590 Gln Arg Arg Thr Met Glu Met Met Tyr Thr AspIle Thr Gln Ala Leu 595 600 605 Glu Ala Lys Gly Ile Glu Ala Asn Pro LysGlu Tyr Leu Thr Phe Phe 610 615 620 Cys Leu Gly Asn Arg Glu Val Lys GlnAsp Gly Glu Tyr Glu Pro Gln 625 630 635 640 Glu Gln Pro Glu Pro Asp ThrAsp Tyr Val Arg Ala Gln Glu Ala Arg 645 650 655 Arg Phe Met Ile Tyr ValHis Thr Lys Met Met Ile Val Asp Asp Glu 660 665 670 Tyr Ile Ile Ile GlySer Ala Asn Ile Asn Gln Arg Ser Met Asp Gly 675 680 685 Ala Arg Asp SerGlu Ile Ala Met Gly Ala Tyr Gln Pro Tyr His Leu 690 695 700 Ala Asn ArgGlu Pro Ala Arg Gly Gln Ile His Gly Phe Arg Met Ala 705 710 715 720 LeuTrp Tyr Glu His Leu Gly Met Leu Asp Asp Val Phe Gln Arg Pro 725 730 735Glu Ser Val Glu Cys Val Gln Lys Val Asn Arg Ile Ala Glu Lys Tyr 740 745750 Trp Asp Ile Tyr Ser Ser Asp Asp Leu Glu Gln Asp Leu Pro Gly His 755760 765 Leu Leu Ser Tyr Pro Ile Gly Val Ala Ser Asp Gly Val Val Thr Glu770 775 780 Leu Pro Gly Met Glu Phe Phe Pro Asp Thr Arg Ala Arg Ile LeuGly 785 790 795 800 Ala Lys Ser Asp Tyr Leu Pro Pro Ile Leu Thr Thr 805810 <210> SEQ ID NO 127 <211> LENGTH: 387 <212> TYPE: DNA <213>ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION:(289) <221> NAME/KEY: unsure <222> LOCATION: (291) <221> NAME/KEY:unsure <222> LOCATION: (307) <221> NAME/KEY: unsure <222> LOCATION:(311) <221> NAME/KEY: unsure <222> LOCATION: (385) <400> SEQUENCE: 127gcggacgcgt gggcgatcaa ttgattcgaa ttctgtcaag ggttttccaa aagatccacg 60gaaggccact agtaagaatc ttgtttgtgg gaaaaatgta ctgattgata tgagcgtgca 120tacagcatat gtgaatgcca tccgaggtgc tcaacatttc atctatattg agaaccagta 180cttccttggc tcttcattta actgggattc acataaagat gttggtgcta ataatctgat 240acccattgag atagcactga aaattgcaaa caagatttat tcgaatgana nattttcagc 300ttatatngta nttcccatgt ggcctgaagg aaatcctacc agtactccta ccccaaagat 360ctttattgat tttccggttc aggangt 387 <210> SEQ ID NO 128 <211> LENGTH: 116<212> TYPE: PRT <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY:UNSURE <222> LOCATION: (92)..(93) <221> NAME/KEY: UNSURE <222> LOCATION:(98) <221> NAME/KEY: UNSURE <222> LOCATION: (100) <400> SEQUENCE: 128Arg Ser Ile Asp Ser Asn Ser Val Lys Gly Phe Pro Lys Asp Pro Arg 1 5 1015 Lys Ala Thr Ser Lys Asn Leu Val Cys Gly Lys Asn Val Leu Ile Asp 20 2530 Met Ser Val His Thr Ala Tyr Val Asn Ala Ile Arg Gly Ala Gln His 35 4045 Phe Ile Tyr Ile Glu Asn Gln Tyr Phe Leu Gly Ser Ser Phe Asn Trp 50 5560 Asp Ser His Lys Asp Val Gly Ala Asn Asn Leu Ile Pro Ile Glu Ile 65 7075 80 Ala Leu Lys Ile Ala Asn Lys Ile Tyr Ser Asn Xaa Xaa Phe Ser Ala 8590 95 Tyr Xaa Val Xaa Pro Met Trp Pro Glu Gly Asn Pro Thr Ser Thr Pro100 105 110 Thr Pro Lys Ile 115 <210> SEQ ID NO 129 <211> LENGTH: 556<212> TYPE: DNA <213> ORGANISM: Glycine max <220> FEATURE: <221>NAME/KEY: unsure <222> LOCATION: (389) <221> NAME/KEY: unsure <222>LOCATION: (427) <221> NAME/KEY: unsure <222> LOCATION: (455) <221>NAME/KEY: unsure <222> LOCATION: (457) <221> NAME/KEY: unsure <222>LOCATION: (465) <221> NAME/KEY: unsure <222> LOCATION: (482) <221>NAME/KEY: unsure <222> LOCATION: (489) <221> NAME/KEY: unsure <222>LOCATION: (518) <221> NAME/KEY: unsure <222> LOCATION: (520) <400>SEQUENCE: 129 acaccatgga caggctcagc aaattgttcc atttcaaacc acttcgagctccttaaggat 60 cctgctctta catggcaact tagaaatatg ggtcaacgag gccagaaaccttcccaacat 120 ggacatgttc cacaagaaaa cgggagaaat ggtttccatg ttgtcccgaaaacttggcgg 180 caaaatcgaa ggtcacatgt ccaaagctgg aaccagtgat ccctatgttacggtatctgt 240 ggccggtgct gtgattgcca gaacttttgt catcagaaac agtgagaaccctgtttggac 300 acagcatttc aatgtccccg ttgcacatct tgcttctgaa gttcactttgttgtcaagga 360 cagtgatatt gtggggttct caagattant gggagcagtt gggaattccagtgggaacat 420 ttagtantgg gacaagagtt gagggctttt tcccnancct tgggngctaatgggaaacca 480 anttaaggnt ggttaaggtt gagtttacca atccaagnan cccctgtttgaaaagggggc 540 ccctttaaac ccatgg 556 <210> SEQ ID NO 130 <211> LENGTH:150 <212> TYPE: PRT <213> ORGANISM: Glycine max <220> FEATURE: <221>NAME/KEY: UNSURE <222> LOCATION: (129) <221> NAME/KEY: UNSURE <222>LOCATION: (140)..(141) <221> NAME/KEY: UNSURE <222> LOCATION: (146)<400> SEQUENCE: 130 His Gly Gln Ala Gln Gln Ile Val Pro Phe Gln Thr ThrSer Ser Ser 1 5 10 15 Leu Arg Ile Leu Leu Leu His Gly Asn Leu Glu IleTrp Val Asn Glu 20 25 30 Ala Arg Asn Leu Pro Asn Met Asp Met Phe His LysLys Thr Gly Glu 35 40 45 Met Val Ser Met Leu Ser Arg Lys Leu Gly Gly LysIle Glu Gly His 50 55 60 Met Ser Lys Ala Gly Thr Ser Asp Pro Tyr Val ThrVal Ser Val Ala 65 70 75 80 Gly Ala Val Ile Ala Arg Thr Phe Val Ile ArgAsn Ser Glu Asn Pro 85 90 95 Val Trp Thr Gln His Phe Asn Val Pro Val AlaHis Leu Ala Ser Glu 100 105 110 Val His Phe Val Val Lys Asp Ser Asp IleVal Gly Phe Ser Arg Leu 115 120 125 Xaa Gly Ala Val Gly Asn Ser Ser GlyAsn Ile Xaa Xaa Trp Asp Lys 130 135 140 Ser Xaa Gly Leu Phe Pro 145 150<210> SEQ ID NO 131 <211> LENGTH: 866 <212> TYPE: DNA <213> ORGANISM:Zea mays <400> SEQUENCE: 131 ccacgcgtcc gcggacgcgt gggcgatcaa ttgattcgaattctgtcaag ggttttccaa 60 aagatccacg gaaggccact agtaagaatc ttgtttgtgggaaaaatgta ctgattgata 120 tgagcgtgca tacagcatat gtgaatgcca tccgaggtgctcaacatttc atctatattg 180 agaaccagta cttccttggc tcttcattta actgggattcacatagagat gttggtgcta 240 ataatctgat acccattgag atagcactga aaattgcaaacaagatttat tcgaatgaga 300 gattttcagc ttatatagta gttccaatgt ggcctgagggaaatcctacc agtactccta 360 cgcaaaggat tctttattga ttttcgggta caggatgtctctatgggtag agcaaataag 420 ggggacggtg cactcccccc ttcgtgtgtg ttttaatcaccatttctaaa atttattcta 480 ttgaagtgat gatgacagtc catcgccgcc aagaggtcacaggagaagaa ctcgcagccc 540 aagtcctcgt ggtagttatg gaggccatgg caagcgccccacaactaatc ttatggtgac 600 gaatcttggt ccggattgta ggattccgac tctcatcaagtgactcctcc cctcacgatg 660 actttgcgga cagatgtctc aattatagtg gtggagctacgaacaacaat ccgatgtcct 720 gatcgagagc gatggagcta atgtttgaac ttgtgtgtttgaccttgtga actaaaaatg 780 tgaactatgg atgtgaactt atatgaattt gtgaacttatgtatttggac ttatgtgaat 840 ttgtgaactt aaaaaaaaaa aaaaag 866 <210> SEQ IDNO 132 <211> LENGTH: 125 <212> TYPE: PRT <213> ORGANISM: Zea mays <400>SEQUENCE: 132 Thr Arg Pro Arg Thr Arg Gly Arg Ser Ile Asp Ser Asn SerVal Lys 1 5 10 15 Gly Phe Pro Lys Asp Pro Arg Lys Ala Thr Ser Lys AsnLeu Val Cys 20 25 30 Gly Lys Asn Val Leu Ile Asp Met Ser Val His Thr AlaTyr Val Asn 35 40 45 Ala Ile Arg Gly Ala Gln His Phe Ile Tyr Ile Glu AsnGln Tyr Phe 50 55 60 Leu Gly Ser Ser Phe Asn Trp Asp Ser His Arg Asp ValGly Ala Asn 65 70 75 80 Asn Leu Ile Pro Ile Glu Ile Ala Leu Lys Ile AlaAsn Lys Ile Tyr 85 90 95 Ser Asn Glu Arg Phe Ser Ala Tyr Ile Val Val ProMet Trp Pro Glu 100 105 110 Gly Asn Pro Thr Ser Thr Pro Thr Gln Arg IleLeu Tyr 115 120 125 <210> SEQ ID NO 133 <211> LENGTH: 2797 <212> TYPE:DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 133 gcacgagacaccatggacag gctcagcaaa ttgttccatt tcaaaccact tcgagctcct 60 taaggatcctgctcttacat ggcaacttag aaatatgggt caacgaggcc agaaaccttc 120 ccaacatggacatgttccac aagaaaacgg gagaaatggt ttccatgttg tcccgaaaac 180 ttggcggcaaaatcgaaggt cacatgtcca aagctggaac cagtgatccc tatgttacgg 240 tatctgtggccggtgctgtg attgccagaa cttttgtcat cagaaacagt gagaaccctg 300 tttggacacagcatttcaat gtccccgttg cacatcttgc ttctgaagtt cactttgttg 360 tcaaggacagtgatattgtg ggttctcaga ttattggagc agtgggaatt ccagtggaac 420 atttatgtagtggaacaaga gttgagggct ttttccccat ccttggtgct aatgggaaac 480 catgtaagggtggatcagtg ttgagtttat ccattcagta cacccctgtt gaaaaggtgc 540 ctctttatagccatggagta ggtgctggtc ctgattatga aggggtccct ggcacctact 600 ttccccttagaaaaggtgga aaagttacac tttatcagga tgctcatgtt gaagaaggtt 660 gccttcccagtttgaaggtg gatggatatg tgaattacaa gcatggaagt tgttggcatg 720 acatatttgatgcaataagt gaggctcgtc gattggttta cattgtgggg tggtctgtgt 780 actacaatgttagtctcatt cgggatagtg ctaatggaaa atcctacact ttaggtgatc 840 ttctcaaagccaaatcacag gaaggtgtga gagtgctgct ccttgtttgg gatgatccca 900 cgtctaaaagcatgcttgga tttaaaacgg ttggactcat gaacactcat gatgaggaca 960 ctcgccagtttttcaagaac tcttcagtac gagtgcttct ttgcccacga gctggtggaa 1020 aaggacatagctgggtcaaa acgcaggaag ctggaacaat ctatacccat catcagaaga 1080 cagtcattgtggatgctgat gcaggtcaga ataaaagaaa aatcaaagct ttcatcggag 1140 gtcttgatttatgtgtgggc cgatatgata ccccaaacca ttccatcttt aggactttgc 1200 agacaacacacaaagatgac tatcataatc ctaactttga ggggccagtt actggttgtc 1260 caagacaaccatggcatgat ttgcattccc aagttgatgg tccagcagca tatgacattc 1320 tcaccaattttgaggagcgt tggttaaggg cattaaaaat gcatagattt caaaagatga 1380 aaagttcacatgatgattca ttactgaaaa ttgatagaat ccctgacatt gttggcattg 1440 atgaagttccttgccagaat gaaaataacc gggagacttg gcatgcccag gtcttccgtt 1500 caattgattctaattctgtg aaaggatttc caaaggaacc acaagatgct ataagaagga 1560 acttggtttgtggaaagaat gtactgatag acatgagcat acattcagcg tatgtcaagg 1620 caattcgagcagcccaaaag tttatctata ttgagaacca atactttctt ggctcgtcat 1680 ataattgggattcttacaaa gaccttggtg caaacaactt aattccaatg gaaattgcat 1740 taaaaatagccaataaaatc aaacaacatg agagattttc tgtgtatatt gtcattccta 1800 tgtggcctgaaggtgtacct acaagtacag ctactcagag gattctcttt tggcagttca 1860 aaacaatgcaaatgatgtat gaaacaattt acaaggccct acaggaggct gggcttgaca 1920 ataagtatgaaccacaggac tacttgaatt tcttttgcct tggcaatcgt gagatacctg 1980 acaatgaaaatgttttaaat gatgtaaaaa ctactggaga aaacaagcct caggcactca 2040 ctaaaaagaaccggagattc atgatttatg ttcattcaaa aggaatgata gtggatgatg 2100 aatatgtgttactggggtct gcaaacataa accagcgatc catggaaggc accagagata 2160 cagagatagcaatgggggca tatcagccta atcatacttg ggcaaagaag caatctaaac 2220 ctcacggacaggtgcatggt tatagaatgt cactatggag cgaacatata ggagccgtgg 2280 aagaatgttttgaggaacca gagagccttg aatgtgtaag acggataagg tcattgagtg 2340 agtttaactggagacaatat gcagcagaag aggtaactga aatgaaaagt catctattaa 2400 aatatccgcttgaagttgat tcaaagggca aagtgaagcc tctttttggc tgtgaggcat 2460 tcccagacgttggtgggaac ataagtggca ctttcacact actcaaagaa aatctcacca 2520 tctgatcatttacgtgagtt ctctaactca gtgaatacta tagcagattt tagtagctta 2580 tttacaatatttagattctt ttggaaaaga aaagaaaagg agtataaaga tttgaacttg 2640 taacctgctttgaagtacta gtttactcaa tgtcttattt aagatttaag gtagtcttaa 2700 aaaaaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2760 aaaaaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaa 2797 <210> SEQ ID NO 134 <211> LENGTH: 840<212> TYPE: PRT <213> ORGANISM: Glycine max <400> SEQUENCE: 134 Thr ArgHis His Gly Gln Ala Gln Gln Ile Val Pro Phe Gln Thr Thr 1 5 10 15 SerSer Ser Leu Arg Ile Leu Leu Leu His Gly Asn Leu Glu Ile Trp 20 25 30 ValAsn Glu Ala Arg Asn Leu Pro Asn Met Asp Met Phe His Lys Lys 35 40 45 ThrGly Glu Met Val Ser Met Leu Ser Arg Lys Leu Gly Gly Lys Ile 50 55 60 GluGly His Met Ser Lys Ala Gly Thr Ser Asp Pro Tyr Val Thr Val 65 70 75 80Ser Val Ala Gly Ala Val Ile Ala Arg Thr Phe Val Ile Arg Asn Ser 85 90 95Glu Asn Pro Val Trp Thr Gln His Phe Asn Val Pro Val Ala His Leu 100 105110 Ala Ser Glu Val His Phe Val Val Lys Asp Ser Asp Ile Val Gly Ser 115120 125 Gln Ile Ile Gly Ala Val Gly Ile Pro Val Glu His Leu Cys Ser Gly130 135 140 Thr Arg Val Glu Gly Phe Phe Pro Ile Leu Gly Ala Asn Gly LysPro 145 150 155 160 Cys Lys Gly Gly Ser Val Leu Ser Leu Ser Ile Gln TyrThr Pro Val 165 170 175 Glu Lys Val Pro Leu Tyr Ser His Gly Val Gly AlaGly Pro Asp Tyr 180 185 190 Glu Gly Val Pro Gly Thr Tyr Phe Pro Leu ArgLys Gly Gly Lys Val 195 200 205 Thr Leu Tyr Gln Asp Ala His Val Glu GluGly Cys Leu Pro Ser Leu 210 215 220 Lys Val Asp Gly Tyr Val Asn Tyr LysHis Gly Ser Cys Trp His Asp 225 230 235 240 Ile Phe Asp Ala Ile Ser GluAla Arg Arg Leu Val Tyr Ile Val Gly 245 250 255 Trp Ser Val Tyr Tyr AsnVal Ser Leu Ile Arg Asp Ser Ala Asn Gly 260 265 270 Lys Ser Tyr Thr LeuGly Asp Leu Leu Lys Ala Lys Ser Gln Glu Gly 275 280 285 Val Arg Val LeuLeu Leu Val Trp Asp Asp Pro Thr Ser Lys Ser Met 290 295 300 Leu Gly PheLys Thr Val Gly Leu Met Asn Thr His Asp Glu Asp Thr 305 310 315 320 ArgGln Phe Phe Lys Asn Ser Ser Val Arg Val Leu Leu Cys Pro Arg 325 330 335Ala Gly Gly Lys Gly His Ser Trp Val Lys Thr Gln Glu Ala Gly Thr 340 345350 Ile Tyr Thr His His Gln Lys Thr Val Ile Val Asp Ala Asp Ala Gly 355360 365 Gln Asn Lys Arg Lys Ile Lys Ala Phe Ile Gly Gly Leu Asp Leu Cys370 375 380 Val Gly Arg Tyr Asp Thr Pro Asn His Ser Ile Phe Arg Thr LeuGln 385 390 395 400 Thr Thr His Lys Asp Asp Tyr His Asn Pro Asn Phe GluGly Pro Val 405 410 415 Thr Gly Cys Pro Arg Gln Pro Trp His Asp Leu HisSer Gln Val Asp 420 425 430 Gly Pro Ala Ala Tyr Asp Ile Leu Thr Asn PheGlu Glu Arg Trp Leu 435 440 445 Arg Ala Leu Lys Met His Arg Phe Gln LysMet Lys Ser Ser His Asp 450 455 460 Asp Ser Leu Leu Lys Ile Asp Arg IlePro Asp Ile Val Gly Ile Asp 465 470 475 480 Glu Val Pro Cys Gln Asn GluAsn Asn Arg Glu Thr Trp His Ala Gln 485 490 495 Val Phe Arg Ser Ile AspSer Asn Ser Val Lys Gly Phe Pro Lys Glu 500 505 510 Pro Gln Asp Ala IleArg Arg Asn Leu Val Cys Gly Lys Asn Val Leu 515 520 525 Ile Asp Met SerIle His Ser Ala Tyr Val Lys Ala Ile Arg Ala Ala 530 535 540 Gln Lys PheIle Tyr Ile Glu Asn Gln Tyr Phe Leu Gly Ser Ser Tyr 545 550 555 560 AsnTrp Asp Ser Tyr Lys Asp Leu Gly Ala Asn Asn Leu Ile Pro Met 565 570 575Glu Ile Ala Leu Lys Ile Ala Asn Lys Ile Lys Gln His Glu Arg Phe 580 585590 Ser Val Tyr Ile Val Ile Pro Met Trp Pro Glu Gly Val Pro Thr Ser 595600 605 Thr Ala Thr Gln Arg Ile Leu Phe Trp Gln Phe Lys Thr Met Gln Met610 615 620 Met Tyr Glu Thr Ile Tyr Lys Ala Leu Gln Glu Ala Gly Leu AspAsn 625 630 635 640 Lys Tyr Glu Pro Gln Asp Tyr Leu Asn Phe Phe Cys LeuGly Asn Arg 645 650 655 Glu Ile Pro Asp Asn Glu Asn Val Leu Asn Asp ValLys Thr Thr Gly 660 665 670 Glu Asn Lys Pro Gln Ala Leu Thr Lys Lys AsnArg Arg Phe Met Ile 675 680 685 Tyr Val His Ser Lys Gly Met Ile Val AspAsp Glu Tyr Val Leu Leu 690 695 700 Gly Ser Ala Asn Ile Asn Gln Arg SerMet Glu Gly Thr Arg Asp Thr 705 710 715 720 Glu Ile Ala Met Gly Ala TyrGln Pro Asn His Thr Trp Ala Lys Lys 725 730 735 Gln Ser Lys Pro His GlyGln Val His Gly Tyr Arg Met Ser Leu Trp 740 745 750 Ser Glu His Ile GlyAla Val Glu Glu Cys Phe Glu Glu Pro Glu Ser 755 760 765 Leu Glu Cys ValArg Arg Ile Arg Ser Leu Ser Glu Phe Asn Trp Arg 770 775 780 Gln Tyr AlaAla Glu Glu Val Thr Glu Met Lys Ser His Leu Leu Lys 785 790 795 800 TyrPro Leu Glu Val Asp Ser Lys Gly Lys Val Lys Pro Leu Phe Gly 805 810 815Cys Glu Ala Phe Pro Asp Val Gly Gly Asn Ile Ser Gly Thr Phe Thr 820 825830 Leu Leu Lys Glu Asn Leu Thr Ile 835 840 <210> SEQ ID NO 135 <211>LENGTH: 462 <212> TYPE: DNA <213> ORGANISM: Zea mays <220> FEATURE:<221> NAME/KEY: unsure <222> LOCATION: (4) <221> NAME/KEY: unsure <222>LOCATION: (31) <221> NAME/KEY: unsure <222> LOCATION: (105) <221>NAME/KEY: unsure <222> LOCATION: (412) <221> NAME/KEY: unsure <222>LOCATION: (436) <400> SEQUENCE: 135 cggnacgctg ggaggccacg ctcccgtgccngctgcctcg tctgctccgc gcccatcacc 60 aggctcatcc gggaatataa tgtacgagctaattctgtta tgganaagac atactctatt 120 ggacgatttg tcactggttt acctcctttctcgaaaaaga agaatgccga gaacaagtgg 180 tctctccata aggagggtct gcaaggccgtcagatagccg agaatatgcg ggaaaaatac 240 aacaagaaac catggatttt ggaagacgaaacaggccaat atcagtatca aggtcaaatg 300 gaaggatcac agtcagctac agctacatattatttgttaa tgaagcacgg caaggaattt 360 aatgcatatc ctgctggctc ttggtttaatttcagtaaaa ttgcacagta cnaacaattg 420 acactggagg aggctnaaga aaagatgaataagaggaaga cc 462 <210> SEQ ID NO 136 <211> LENGTH: 83 <212> TYPE: PRT<213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: UNSURE <222>LOCATION: (69) <221> NAME/KEY: UNSURE <222> LOCATION: (77) <400>SEQUENCE: 136 Arg Gln Ile Ala Glu Asn Met Arg Glu Lys Tyr Asn Lys LysPro Trp 1 5 10 15 Ile Leu Glu Asp Glu Thr Gly Gln Tyr Gln Tyr Gln GlyGln Met Glu 20 25 30 Gly Ser Gln Ser Ala Thr Ala Thr Tyr Tyr Leu Leu MetLys His Gly 35 40 45 Lys Glu Phe Asn Ala Tyr Pro Ala Gly Ser Trp Phe AsnPhe Ser Lys 50 55 60 Ile Ala Gln Tyr Xaa Gln Leu Thr Leu Glu Glu Ala XaaGlu Lys Met 65 70 75 80 Asn Lys Arg <210> SEQ ID NO 137 <211> LENGTH:1710 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 137ccacgcgtcc gcggacgcgt gggggcacgc tcccgtgccc gctgcctcgt ctgctccgcg 60cccatcacca ggctcatccg ggaatataat gtacgagcta attctgttat ggataagaca 120tactctattg gacgatttgt cactggttta cctcctttct cgaaaaagaa gaatgccgag 180aacaagtggt ctctccataa ggagggtctg caaggccgtc agatagccga gaatatgcgg 240gaaaaataca acaagaaacc atggattttg gaagacgaaa caggccaata tcagtatcaa 300ggtcaaatgg aaggatcaca gtcagctaca gctacatatt atttgttaat gaggcacggc 360aaggaattta atgcatatcc tgctggctct tggtttaatt tcagtaaaat tgcacagtac 420aaacaattga cactggagga ggctgaagaa aagatgaata agaggaagac cagtgcaact 480ggttatgaac gctggatgat gaaagcagcg gcaaatggac cagctgcctt tggttcagac 540atgaagaagc ttgaggccac aaatggtggg gaaaaagaaa gtgctcgtcc taagaaagga 600aaaaataatg aagagggtaa taattctgat aagggtgagg aggatgaaga agaagaagca 660acacgcaaga atagacttgg actaactaaa aagggcatgg atgatgatga ggaaggtgta 720aaaaatctag atttcgattt ggatgatgaa attgagaaag gtgatgactg ggagcatgaa 780gaaacattca ctgacgatga tgaggctgta gacattgatc cagaggaacg ggcagattta 840gctcctgaaa ttcctgctcc acctgaaatc aagcaggatg atgaagagaa cgaagaagaa 900gggggcctga gcaagtctgg caaggaacta aaaaagttgc ttgggcgcgc tgctggacta 960aatgagtcag atgcggatga ggatgaagaa gacgaagatc aagaagatga ttcatcgcca 1020gtgcttgctc caaaacagaa ggatcaagtt aaagatgaac ctgtggatag cagcccatct 1080aaaccagcac catcaggaca tgctcgaggc acacctccgg catccaaatc caagcaaaag 1140agaaaatcag gtgttgatga tgcaaaaact tctagtggtg cagcttcaaa gaaagcaaag 1200gtggaattgg atgcgaaagc atcaggcctc aaagaggagg catcatcttt ggcaaaacct 1260gcatcaaaga cctctgctgc atcaaaaagt gggacaagcg tatcacctgt cacagaggat 1320gaaatcagga gtgttcttct tgcagtggct ccagtcacca cacaagatct ggtatccaga 1380ttcaagtcta ggcttcgagg tgcagaggac aagaatgcat ttgctgaaat tctgaagaaa 1440atttcgaaga tacagaagac gaacggccac aactacgttg tcctcagaga tgacaagaag 1500taaataaccg aacaaaggcg ctgacatcgc aaccgttggc tgtactgaga taatatgttt 1560atgtaacatg ctgccttgaa gataaacaag tctgtaacac tagaattgtc ttctaaccct 1620tgtgtacttg aattttattt tactattcca gattttattt atgcggcttt gcactacaca 1680cctgttataa atggaaatcc attgggagtt 1710 <210> SEQ ID NO 138 <211> LENGTH:500 <212> TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 138 Pro ArgVal Arg Gly Arg Val Gly Ala Arg Ser Arg Ala Arg Cys Leu 1 5 10 15 ValCys Ser Ala Pro Ile Thr Arg Leu Ile Arg Glu Tyr Asn Val Arg 20 25 30 AlaAsn Ser Val Met Asp Lys Thr Tyr Ser Ile Gly Arg Phe Val Thr 35 40 45 GlyLeu Pro Pro Phe Ser Lys Lys Lys Asn Ala Glu Asn Lys Trp Ser 50 55 60 LeuHis Lys Glu Gly Leu Gln Gly Arg Gln Ile Ala Glu Asn Met Arg 65 70 75 80Glu Lys Tyr Asn Lys Lys Pro Trp Ile Leu Glu Asp Glu Thr Gly Gln 85 90 95Tyr Gln Tyr Gln Gly Gln Met Glu Gly Ser Gln Ser Ala Thr Ala Thr 100 105110 Tyr Tyr Leu Leu Met Arg His Gly Lys Glu Phe Asn Ala Tyr Pro Ala 115120 125 Gly Ser Trp Phe Asn Phe Ser Lys Ile Ala Gln Tyr Lys Gln Leu Thr130 135 140 Leu Glu Glu Ala Glu Glu Lys Met Asn Lys Arg Lys Thr Ser AlaThr 145 150 155 160 Gly Tyr Glu Arg Trp Met Met Lys Ala Ala Ala Asn GlyPro Ala Ala 165 170 175 Phe Gly Ser Asp Met Lys Lys Leu Glu Ala Thr AsnGly Gly Glu Lys 180 185 190 Glu Ser Ala Arg Pro Lys Lys Gly Lys Asn AsnGlu Glu Gly Asn Asn 195 200 205 Ser Asp Lys Gly Glu Glu Asp Glu Glu GluGlu Ala Thr Arg Lys Asn 210 215 220 Arg Leu Gly Leu Thr Lys Lys Gly MetAsp Asp Asp Glu Glu Gly Val 225 230 235 240 Lys Asn Leu Asp Phe Asp LeuAsp Asp Glu Ile Glu Lys Gly Asp Asp 245 250 255 Trp Glu His Glu Glu ThrPhe Thr Asp Asp Asp Glu Ala Val Asp Ile 260 265 270 Asp Pro Glu Glu ArgAla Asp Leu Ala Pro Glu Ile Pro Ala Pro Pro 275 280 285 Glu Ile Lys GlnAsp Asp Glu Glu Asn Glu Glu Glu Gly Gly Leu Ser 290 295 300 Lys Ser GlyLys Glu Leu Lys Lys Leu Leu Gly Arg Ala Ala Gly Leu 305 310 315 320 AsnGlu Ser Asp Ala Asp Glu Asp Glu Glu Asp Glu Asp Gln Glu Asp 325 330 335Asp Ser Ser Pro Val Leu Ala Pro Lys Gln Lys Asp Gln Val Lys Asp 340 345350 Glu Pro Val Asp Ser Ser Pro Ser Lys Pro Ala Pro Ser Gly His Ala 355360 365 Arg Gly Thr Pro Pro Ala Ser Lys Ser Lys Gln Lys Arg Lys Ser Gly370 375 380 Val Asp Asp Ala Lys Thr Ser Ser Gly Ala Ala Ser Lys Lys AlaLys 385 390 395 400 Val Glu Leu Asp Ala Lys Ala Ser Gly Leu Lys Glu GluAla Ser Ser 405 410 415 Leu Ala Lys Pro Ala Ser Lys Thr Ser Ala Ala SerLys Ser Gly Thr 420 425 430 Ser Val Ser Pro Val Thr Glu Asp Glu Ile ArgSer Val Leu Leu Ala 435 440 445 Val Ala Pro Val Thr Thr Gln Asp Leu ValSer Arg Phe Lys Ser Arg 450 455 460 Leu Arg Gly Ala Glu Asp Lys Asn AlaPhe Ala Glu Ile Leu Lys Lys 465 470 475 480 Ile Ser Lys Ile Gln Lys ThrAsn Gly His Asn Tyr Val Val Leu Arg 485 490 495 Asp Asp Lys Lys 500<210> SEQ ID NO 139 <211> LENGTH: 396 <212> TYPE: DNA <213> ORGANISM:Zea mays <400> SEQUENCE: 139 gaacaagcac cagcgaaacc atccgatgtc aaaagaactcgtagggatcg tacggagatg 60 gaaaacatta tattcaagct ttttgaaagg caacccaattgggcactaaa ggcgctggtg 120 caagaaactg accagccaga gcaatttctg aaggagatattgaacgatct ctgtgtgtat 180 aataaacgag gaccaaacca gggaactcat gagcttaagccagagtacaa gaaatccaca 240 ggggacactg atgctgctta aagatgatag acgacttggttgtttatgtg cagaatcgct 300 agttctctgt cggtaactcg gggcacacaa acacaagcgtgatctagttt atttgtatgc 360 tttaagaggt gttatttgta agctggatac acattt 396<210> SEQ ID NO 140 <211> LENGTH: 86 <212> TYPE: PRT <213> ORGANISM: Zeamays <400> SEQUENCE: 140 Glu Gln Ala Pro Ala Lys Pro Ser Asp Val Lys ArgThr Arg Arg Asp 1 5 10 15 Arg Thr Glu Met Glu Asn Ile Ile Phe Lys LeuPhe Glu Arg Gln Pro 20 25 30 Asn Trp Ala Leu Lys Ala Leu Val Gln Glu ThrAsp Gln Pro Glu Gln 35 40 45 Phe Leu Lys Glu Ile Leu Asn Asp Leu Cys ValTyr Asn Lys Arg Gly 50 55 60 Pro Asn Gln Gly Thr His Glu Leu Lys Pro GluTyr Lys Lys Ser Thr 65 70 75 80 Gly Asp Thr Asp Ala Ala 85 <210> SEQ IDNO 141 <211> LENGTH: 567 <212> TYPE: DNA <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (361)<221> NAME/KEY: unsure <222> LOCATION: (495) <221> NAME/KEY: unsure<222> LOCATION: (515) <221> NAME/KEY: unsure <222> LOCATION: (526) <221>NAME/KEY: unsure <222> LOCATION: (531) <221> NAME/KEY: unsure <222>LOCATION: (554) <400> SEQUENCE: 141 tagggatcgc cgggaactgg aaaacattatcttcaagctt ttcgaaaagc agcctaactg 60 ggcactgaag gctctagtgc aagagactgaccagccagag caattcctca aggagatatt 120 gaacgacctc tgtatgtaca acaaacgagggccaaaccag ggcacgcacg agctcaagcc 180 cgagtacaag aagtcgtctg aggacgctgccggtgccccg tgaagatgat ctatctgttt 240 tggcggctgt tgtgtaactg tgataccaggaacctgttgg cttgatgcgc ggaactatga 300 gttgtctatg tgaagcctcc agtcatgcttggggctgcat ttgcttgctt tttattaaga 360 ngcattgttg taattgtatg ggtataagatgatacatata caacacatgt cacagagagg 420 aaatggatct actgataaac acacgaaagattgtttgatt tctgggtaga tgtaagaatc 480 attcaaggat cgacntgtgc atcctgtcctttganaatcc gtactnaata natcaaattt 540 tggctaaaat atangaatca cctgatt 567<210> SEQ ID NO 142 <211> LENGTH: 68 <212> TYPE: PRT <213> ORGANISM:Triticum aestivum <400> SEQUENCE: 142 Arg Arg Glu Leu Glu Asn Ile IlePhe Lys Leu Phe Glu Lys Gln Pro 1 5 10 15 Asn Trp Ala Leu Lys Ala LeuVal Gln Glu Thr Asp Gln Pro Glu Gln 20 25 30 Phe Leu Lys Glu Ile Leu AsnAsp Leu Cys Met Tyr Asn Lys Arg Gly 35 40 45 Pro Asn Gln Gly Thr His GluLeu Lys Pro Glu Tyr Lys Lys Ser Ser 50 55 60 Glu Asp Ala Ala 65 <210>SEQ ID NO 143 <211> LENGTH: 813 <212> TYPE: DNA <213> ORGANISM: Zea mays<400> SEQUENCE: 143 agcccacgct ccagttcaag atggagttgg cacaaactaacactgggaat acacctaaga 60 gctactcttt gaatatgttc aaagatttcg tgcctatgtgtgttttctcc gaatctaacc 120 aagggaaact ttcatgcgaa ggaaaagtcg agcataaatttgacatggaa cctcacagtg 180 ataatttggc gaactatgga aagttatgtc gtgaaagaacacaaaaatat atggttaaat 240 ctagacaagt gcaggtactt gacaatgacc acggtatgagcatgagacca atgcctggct 300 tggttggtct cataccttct ggttcccacg cgtccgaacaagcaccagcg aaaccatccg 360 atgtcaaaag aactcgtagg gatcgtacgg agatggaaaacattatattc aagctttttg 420 aaaggcaacc caattgggca ctaaaggcgc tggtgcaagaaactgaccag ccagagcaat 480 ttctgaagga gatattgaac gatctctgtg tgtataataaacgaggacca aaccagggaa 540 ctcatgagct taagccagag tacaagaaat ccacaggggacactgatgct gcttaaagat 600 gatagacgac ttggttgttt atgtgcagaa tcgctagttctctgtcggta actcgggcac 660 acaaacacaa gcgtgatcta gttttatttg tatgctttaagaggtgttat ttgtaagctg 720 gatacacatt tgtttagaga ggaaaagggg aaagataattgaaacaaaat gtaagagtct 780 aaattgttgg accaatgttc tttatcgggt att 813 <210>SEQ ID NO 144 <211> LENGTH: 197 <212> TYPE: PRT <213> ORGANISM: Zea mays<400> SEQUENCE: 144 Pro Thr Leu Gln Phe Lys Met Glu Leu Ala Gln Thr AsnThr Gly Asn 1 5 10 15 Thr Pro Lys Ser Tyr Ser Leu Asn Met Phe Lys AspPhe Val Pro Met 20 25 30 Cys Val Phe Ser Glu Ser Asn Gln Gly Lys Leu SerCys Glu Gly Lys 35 40 45 Val Glu His Lys Phe Asp Met Glu Pro His Ser AspAsn Leu Ala Asn 50 55 60 Tyr Gly Lys Leu Cys Arg Glu Arg Thr Gln Lys TyrMet Val Lys Ser 65 70 75 80 Arg Gln Val Gln Val Leu Asp Asn Asp His GlyMet Ser Met Arg Pro 85 90 95 Met Pro Gly Leu Val Gly Leu Ile Pro Ser GlySer His Ala Ser Glu 100 105 110 Gln Ala Pro Ala Lys Pro Ser Asp Val LysArg Thr Arg Arg Asp Arg 115 120 125 Thr Glu Met Glu Asn Ile Ile Phe LysLeu Phe Glu Arg Gln Pro Asn 130 135 140 Trp Ala Leu Lys Ala Leu Val GlnGlu Thr Asp Gln Pro Glu Gln Phe 145 150 155 160 Leu Lys Glu Ile Leu AsnAsp Leu Cys Val Tyr Asn Lys Arg Gly Pro 165 170 175 Asn Gln Gly Thr HisGlu Leu Lys Pro Glu Tyr Lys Lys Ser Thr Gly 180 185 190 Asp Thr Asp AlaAla 195 <210> SEQ ID NO 145 <211> LENGTH: 677 <212> TYPE: DNA <213>ORGANISM: Oryza sativa <400> SEQUENCE: 145 gcacgagggt acttgcgaatgacaatggaa tgagcatgag gccgttgcct ggcttggtgg 60 gtctgatgtc ttctggtccaaaacagaagg agaagaagcc actaccagta aaaccatcag 120 acatgaaaag aacgagaagggatcgcaggg aactggaaaa tatcttattc aagctttttg 180 agaggcagcc gaattggtctcttaagaatc tcatgcaaga aactgatcaa ccagagcaat 240 tcttgaagga gatattgaatgatctgtgtt tctacaacaa aaggggtcca aatcaaggga 300 cgcatgaact taagcctgaatacaagaaat ctacagagga cgctgatgct actgctactt 360 agaagtctta tgcttcggtctactagatag gagtcctcgt ccagtgtgga ctccaatccg 420 ttacttgctg cgttcaaggaaatcttggat acttcgttta tagtttgttt cttgagaaac 480 atattttgta cgcatcaagcgatacatctt ttggtgttac tggcggccat atcttagacc 540 cagatttcgg ggactcgatatattcgtgtc catcagagtt tagaaacaag gatgttgaaa 600 tttgtgtgta taatgtattttagctcttct aggcaaacta acatgatgat tttcatgatc 660 aaaaaaaaaa aaaaaaa 677<210> SEQ ID NO 146 <211> LENGTH: 119 <212> TYPE: PRT <213> ORGANISM:Oryza sativa <400> SEQUENCE: 146 Thr Arg Val Leu Ala Asn Asp Asn Gly MetSer Met Arg Pro Leu Pro 1 5 10 15 Gly Leu Val Gly Leu Met Ser Ser GlyPro Lys Gln Lys Glu Lys Lys 20 25 30 Pro Leu Pro Val Lys Pro Ser Asp MetLys Arg Thr Arg Arg Asp Arg 35 40 45 Arg Glu Leu Glu Asn Ile Leu Phe LysLeu Phe Glu Arg Gln Pro Asn 50 55 60 Trp Ser Leu Lys Asn Leu Met Gln GluThr Asp Gln Pro Glu Gln Phe 65 70 75 80 Leu Lys Glu Ile Leu Asn Asp LeuCys Phe Tyr Asn Lys Arg Gly Pro 85 90 95 Asn Gln Gly Thr His Glu Leu LysPro Glu Tyr Lys Lys Ser Thr Glu 100 105 110 Asp Ala Asp Ala Thr Ala Thr115 <210> SEQ ID NO 147 <211> LENGTH: 1365 <212> TYPE: DNA <213>ORGANISM: Oryza sativa <400> SEQUENCE: 147 gaattcggca cgagcttacagattattgtt cgttcatcta actgcacgtc acgccctccg 60 cgtgtgcctc cccttcgcccgctcgctccg tcgttaagcc agatcgcccg ccgctccgtg 120 ccctagcgcc gccgccgccgtcgccgacac cggaatcgcc gggcagactg cgagcgcgga 180 gccaccagcg tggggcggcgggatgggcga ggaggccaag tacctcgaga cggcgcgggc 240 cgagcgctcc gtgtggctgatgaagtgccc cccggtcgtc tcgcacgcct ggcagggcgc 300 cgtgtcctcc tccgacgccgccggctccaa ccctaacccc gtcgtcgcca aggtcgtcct 360 ctcccttgac ctcctccgctccgaggagcc ctccctccag ttcaagatgg agatggctca 420 aactaacact ggcaatacaccaaagagtta ctccttgaat atgtccaagg attttgtacc 480 aatgtgtgtt ttctctgagtctaaccaagg gaaactttca tgtgaaggaa aagtcgagca 540 taaatttgac atgaagcctcacagtgataa tttggtgaac tatggaaagt tatgccgtga 600 aaggacacaa aagtccatgatcaaaactag aaaagtgcag gtaattgagg atcacagaat 660 gagcctgata ccgttgcctggcatggttgg tctcatacct tctggttcta aggagaagaa 720 gaagcaaaca ccaaccaaaccatctgatgc aaaaagaata cgtagggatc gcagggaact 780 ggaaaatatt atattcaagctttttgaaag acagcccaat tgggcactaa aggcgctggt 840 gcaagaaact gaccagccagagcaattcct gaaggagatt ctgaatgatc tgtgttttta 900 caacaaacga ggaccaaaccagggaacgca tgagctcaag cctgagtaca agaaatctac 960 aggggacact gatgcttcttgaatatgcta gttcatttaa ctgctttcgg ataaccaaca 1020 tactcaacct gttgactctgaagtcacgaa cattgagttg tgcagacatc tcatatttca 1080 gtaacattcg ttctgctgctaaatgctaag aggcataggc gaatagttgt aaggatagtt 1140 tgatacattt attagagaaaagaatagaaa agaaaaatct ggaatctatg ttcaggtaaa 1200 ataatgtgcc tggtgggagtcgcagcgtag agcatagtcc tattcataat tttcgttgtt 1260 tatatgctgt ttttatgtctcatttcatca tttgaaatgc aggatttatc tgattctttg 1320 aagatgtttt ttttattaccaacaatatta tgtcttcata ttctt 1365 <210> SEQ ID NO 148 <211> LENGTH: 259<212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 148 Met GlyGlu Glu Ala Lys Tyr Leu Glu Thr Ala Arg Ala Glu Arg Ser 1 5 10 15 ValTrp Leu Met Lys Cys Pro Pro Val Val Ser His Ala Trp Gln Gly 20 25 30 AlaVal Ser Ser Ser Asp Ala Ala Gly Ser Asn Pro Asn Pro Val Val 35 40 45 AlaLys Val Val Leu Ser Leu Asp Leu Leu Arg Ser Glu Glu Pro Ser 50 55 60 LeuGln Phe Lys Met Glu Met Ala Gln Thr Asn Thr Gly Asn Thr Pro 65 70 75 80Lys Ser Tyr Ser Leu Asn Met Ser Lys Asp Phe Val Pro Met Cys Val 85 90 95Phe Ser Glu Ser Asn Gln Gly Lys Leu Ser Cys Glu Gly Lys Val Glu 100 105110 His Lys Phe Asp Met Lys Pro His Ser Asp Asn Leu Val Asn Tyr Gly 115120 125 Lys Leu Cys Arg Glu Arg Thr Gln Lys Ser Met Ile Lys Thr Arg Lys130 135 140 Val Gln Val Ile Glu Asp His Arg Met Ser Leu Ile Pro Leu ProGly 145 150 155 160 Met Val Gly Leu Ile Pro Ser Gly Ser Lys Glu Lys LysLys Gln Thr 165 170 175 Pro Thr Lys Pro Ser Asp Ala Lys Arg Ile Arg ArgAsp Arg Arg Glu 180 185 190 Leu Glu Asn Ile Ile Phe Lys Leu Phe Glu ArgGln Pro Asn Trp Ala 195 200 205 Leu Lys Ala Leu Val Gln Glu Thr Asp GlnPro Glu Gln Phe Leu Lys 210 215 220 Glu Ile Leu Asn Asp Leu Cys Phe TyrAsn Lys Arg Gly Pro Asn Gln 225 230 235 240 Gly Thr His Glu Leu Lys ProGlu Tyr Lys Lys Ser Thr Gly Asp Thr 245 250 255 Asp Ala Ser <210> SEQ IDNO 149 <211> LENGTH: 1286 <212> TYPE: DNA <213> ORGANISM: Oryza sativa<400> SEQUENCE: 149 gaattcggca cgaggttcta acaatttctc cgccttctcgtcctcctcgc cgaaccaccc 60 ccatcgaccc acacaaccag cgaaccggaa ggatttcactcagctgagtt tcctgcctcc 120 atccccacgg cgatggcgga ggaggcgaag aatctggagacggcccgggc cgaccgctcc 180 gtgtggctca tgaagtgccc gaccgtcgtc tcccgcgcctggcaggaggc cgccaccgcc 240 gccgcctcct cctcctcctc ctcggacgcc gccgctggcgccaactccaa ctccaacgcg 300 aaccctaacc ccgtcgtcgc caaggtcatc gtctccctcgacccgctccg ctccgaagac 360 cagcagctcc agttcaagat ggagatggct caaacaggcaatggcaatac accaaagagt 420 tactccttga acatgttcaa ggattttgtg ccaatgtgtgtcttttctga atccaaccaa 480 gggaagcttt catgcgaagg gaaagtcggg cacaaatttgacatggaacc gcacagcgat 540 aatcttgtga actatgggaa gttatgtcgt gagaggactcaaaaatctat gattaaaaat 600 agaaaattga tggtacttgc gaatgacaat ggaatgagcatgaggccgtt gcctggcttg 660 gtgggtctga tgtcttctgg tccacaacag aaggagaagaagccactacc agtaaaacca 720 tcagacatga aaagaacgag aagggatcgc agggaactggaaaatatctt attcaagctt 780 tttgagaggc agccgaattg gtctcttaag aatctcatgcaagaaactga tcaaccagag 840 caattcttga aggagatatt gaatgatctg tgtttctacaacaaaagggg tccaaatcaa 900 gggacgcatg aacttaagcc tgaatacaag aaatctacagaggacgctga tgctactgct 960 acttagaagt cttatgcttc ggtctactag ataggagtcctcgtccagtg tggactccaa 1020 tccgttactt gctgcgttca aggaaatctt ggatacttcgtttatagttt gtttcttgag 1080 aaacatattt tgtacgcatc aagcgataca tcttttggtgttactggcgg ccatatctta 1140 gacccagatt tcggggactc gatatattcg tgtccatcagagtttagaaa caaggatgtt 1200 gaaatttgtg tgtataatgt attttagctc ttctaggcaaactaacatga tgattttcat 1260 gatcattaaa caccttgacc tactct 1286 <210> SEQID NO 150 <211> LENGTH: 277 <212> TYPE: PRT <213> ORGANISM: Oryza sativa<400> SEQUENCE: 150 Met Ala Glu Glu Ala Lys Asn Leu Glu Thr Ala Arg AlaAsp Arg Ser 1 5 10 15 Val Trp Leu Met Lys Cys Pro Thr Val Val Ser ArgAla Trp Gln Glu 20 25 30 Ala Ala Thr Ala Ala Ala Ser Ser Ser Ser Ser SerAsp Ala Ala Ala 35 40 45 Gly Ala Asn Ser Asn Ser Asn Ala Asn Pro Asn ProVal Val Ala Lys 50 55 60 Val Ile Val Ser Leu Asp Pro Leu Arg Ser Glu AspGln Gln Leu Gln 65 70 75 80 Phe Lys Met Glu Met Ala Gln Thr Gly Asn GlyAsn Thr Pro Lys Ser 85 90 95 Tyr Ser Leu Asn Met Phe Lys Asp Phe Val ProMet Cys Val Phe Ser 100 105 110 Glu Ser Asn Gln Gly Lys Leu Ser Cys GluGly Lys Val Gly His Lys 115 120 125 Phe Asp Met Glu Pro His Ser Asp AsnLeu Val Asn Tyr Gly Lys Leu 130 135 140 Cys Arg Glu Arg Thr Gln Lys SerMet Ile Lys Asn Arg Lys Leu Met 145 150 155 160 Val Leu Ala Asn Asp AsnGly Met Ser Met Arg Pro Leu Pro Gly Leu 165 170 175 Val Gly Leu Met SerSer Gly Pro Gln Gln Lys Glu Lys Lys Pro Leu 180 185 190 Pro Val Lys ProSer Asp Met Lys Arg Thr Arg Arg Asp Arg Arg Glu 195 200 205 Leu Glu AsnIle Leu Phe Lys Leu Phe Glu Arg Gln Pro Asn Trp Ser 210 215 220 Leu LysAsn Leu Met Gln Glu Thr Asp Gln Pro Glu Gln Phe Leu Lys 225 230 235 240Glu Ile Leu Asn Asp Leu Cys Phe Tyr Asn Lys Arg Gly Pro Asn Gln 245 250255 Gly Thr His Glu Leu Lys Pro Glu Tyr Lys Lys Ser Thr Glu Asp Ala 260265 270 Asp Ala Thr Ala Thr 275 <210> SEQ ID NO 151 <211> LENGTH: 642<212> TYPE: DNA <213> ORGANISM: Triticum aestivum <400> SEQUENCE: 151caaaaggaca cgtagggatc gccgggaact ggaaaacatt atcttcaagc ttttcgaaaa 60gcagcctaac tgggcactga aggctctagt gcaagagact gaccagccag agcaattcct 120caaggagata ttgaacgacc tctgtatgta caacaaacga gggccaaacc agggcacgca 180cgagctcaag cccgagtaca agaagtcgtc tgaggacgct gccggtgccc cgtgaagatg 240atctatctgt tttggcggct gttgtgtaac tgtgatacca ggaacctgtt ggcttgatgc 300gcggaactat gagttgtcta tgtgaagcct ccagtcatgc ttgggctgca tttgcttgct 360ttttattaag aggcattgtt gtaattgtat gggtataaga tgatacatat acaacacatg 420tcacagagag gaaatggatc tactgataaa ccacacgaaa gattgtttga ttttctgggt 480agatgtaaga atcatgtcga ggattcgacg tgtgcgatcc ttgttccttt tgagaactcc 540cgtacttcaa tacaatccag atgtttgggc taaaacatat atggaatcta gcctgattta 600aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 642 <210> SEQ ID NO 152<211> LENGTH: 77 <212> TYPE: PRT <213> ORGANISM: Triticum aestivum <400>SEQUENCE: 152 Lys Arg Thr Arg Arg Asp Arg Arg Glu Leu Glu Asn Ile IlePhe Lys 1 5 10 15 Leu Phe Glu Lys Gln Pro Asn Trp Ala Leu Lys Ala LeuVal Gln Glu 20 25 30 Thr Asp Gln Pro Glu Gln Phe Leu Lys Glu Ile Leu AsnAsp Leu Cys 35 40 45 Met Tyr Asn Lys Arg Gly Pro Asn Gln Gly Thr His GluLeu Lys Pro 50 55 60 Glu Tyr Lys Lys Ser Ser Glu Asp Ala Ala Gly Ala Pro65 70 75 <210> SEQ ID NO 153 <211> LENGTH: 2006 <212> TYPE: DNA <213>ORGANISM: Zea mays <400> SEQUENCE: 153 caaatatttg caagccacaa gccaagccaagcaaaaccct aagaaaccca ccctccctcc 60 agtctcctcg acccccgccg ccgccgccgccatcacaacc cagagcaccc cctccccacc 120 ccaccctgtc ccgatggctg cggacgccgccgctgcggcg atgtccgcga cttccctctg 180 cgacgatctc gagcccgcca ccgtgcgcacccgcatccgc gatgtcctgg ccgcaggtgc 240 cgcgcgtgcc ggggaccgcg tcgtggtcggtgggtgggtc cggacgggcc gggagcaggg 300 gaagggctct ttcgccttcc tggagctcagcgacgggtcc tgcgctgcca cgcttcaggc 360 catcgttgac gccgccgtgc acccgctagcgcgcctcacc gccaccggca cctccgtcct 420 cgtcgagggc gtgatcaagg agccgcccgaggggaccaag cagaacgtcg agctgaaggt 480 cagccgcgtg cttgaggtcg gcgaggtcgatgccgccgtc tacccgctgc ccaagggcaa 540 ggtcaaactc acgctcgaga agctcagggacgtcgttcat ctccgttcgc gcaccaacac 600 gattggagcg gttgctagga ttaggcaccagttagcctat gcatcacaca ggttctttga 660 tgagaatgga tttctctatg tacatacaccaataataact acgagtgact gtgaaggtgc 720 tggtgagatg tttcaagtga caactttatttagccaggct gagaaaactg agaaggaatt 780 gagagagaac ccaaaacctt ctgattctgaaattgaggca gctaaagttc ttgtcaagga 840 aaaaggtgat gtggtcgcac agcttaaagctgcaaaagcc agcaaacaag agatatcaac 900 tgctgtcgat gagcttaata gggcaaaggaaattgtttca aaactggagg agaggttcaa 960 gttaaagcct gggattccac gcaaggatgatggttcaata gcttttgaaa atgacttctt 1020 caaacgtcaa gcttttttga cagtgtctggacagcttcag gttgagacat atgcatgtgc 1080 tcttagtagc gtttatacct ttggaccaacattccgggca gagaactcac atacgtcacg 1140 acatttggca gagttttgga tggttgaaccggaaatcgca tttgcaaact tgcaggatga 1200 catgaactgt gcagaaaaat atgtacagtacctttgcaag tggctgcttg accattgcca 1260 ggaagatatg gagtttatgg tgaaaaattatgataagagt gcaattgaac gcctggagct 1320 tgtttcgtca actccttttg tgcggatttcatacacaaag gctgtggagc tcttgaaaaa 1380 tgttaccgac aagaagtttg acaacaaagttgaatgggga attgatttag cctctgagca 1440 tgaaaggtat ttaacagagg atatatttaagaagccagtt attgtctata actatccgaa 1500 aggaataaag gcattttata tgagactcaatgatgatgac aagacggtgg ctgcaatgga 1560 tgttcttgtg cctaaggttg gtgaattgattggtggaagc caaagggagg aacggttaga 1620 tgttctgaaa cagaggatac ttgatgctggcctgcctttg gaaccctatg aatggtactt 1680 ggacctccgt cgctttggat ctgtaaagcacagtggtttt gggttgggtt ttgaaaggat 1740 gattctattt gcaaccggaa tggagaacatcagagatgtt atacccttcc caaggtaccc 1800 aggaagggct gatctgtgat ctttggagacattggttaaa cagatgcgag ttgaattcaa 1860 cacgcgacta gtgattttga tatgttttcaccgttaagtt tctacttcct aatttgcaca 1920 ttttaattat aaattgtgga actggtacggttgtcaaaaa tcatgtaatt ttctggttgt 1980 attttattat atttgtatct ctgatc 2006<210> SEQ ID NO 154 <211> LENGTH: 561 <212> TYPE: PRT <213> ORGANISM:Zea mays <400> SEQUENCE: 154 Met Ala Ala Asp Ala Ala Ala Ala Ala Met SerAla Thr Ser Leu Cys 1 5 10 15 Asp Asp Leu Glu Pro Ala Thr Val Arg ThrArg Ile Arg Asp Val Leu 20 25 30 Ala Ala Gly Ala Ala Arg Ala Gly Asp ArgVal Val Val Gly Gly Trp 35 40 45 Val Arg Thr Gly Arg Glu Gln Gly Lys GlySer Phe Ala Phe Leu Glu 50 55 60 Leu Ser Asp Gly Ser Cys Ala Ala Thr LeuGln Ala Ile Val Asp Ala 65 70 75 80 Ala Val His Pro Leu Ala Arg Leu ThrAla Thr Gly Thr Ser Val Leu 85 90 95 Val Glu Gly Val Ile Lys Glu Pro ProGlu Gly Thr Lys Gln Asn Val 100 105 110 Glu Leu Lys Val Ser Arg Val LeuGlu Val Gly Glu Val Asp Ala Ala 115 120 125 Val Tyr Pro Leu Pro Lys GlyLys Val Lys Leu Thr Leu Glu Lys Leu 130 135 140 Arg Asp Val Val His LeuArg Ser Arg Thr Asn Thr Ile Gly Ala Val 145 150 155 160 Ala Arg Ile ArgHis Gln Leu Ala Tyr Ala Ser His Arg Phe Phe Asp 165 170 175 Glu Asn GlyPhe Leu Tyr Val His Thr Pro Ile Ile Thr Thr Ser Asp 180 185 190 Cys GluGly Ala Gly Glu Met Phe Gln Val Thr Thr Leu Phe Ser Gln 195 200 205 AlaGlu Lys Thr Glu Lys Glu Leu Arg Glu Asn Pro Lys Pro Ser Asp 210 215 220Ser Glu Ile Glu Ala Ala Lys Val Leu Val Lys Glu Lys Gly Asp Val 225 230235 240 Val Ala Gln Leu Lys Ala Ala Lys Ala Ser Lys Gln Glu Ile Ser Thr245 250 255 Ala Val Asp Glu Leu Asn Arg Ala Lys Glu Ile Val Ser Lys LeuGlu 260 265 270 Glu Arg Phe Lys Leu Lys Pro Gly Ile Pro Arg Lys Asp AspGly Ser 275 280 285 Ile Ala Phe Glu Asn Asp Phe Phe Lys Arg Gln Ala PheLeu Thr Val 290 295 300 Ser Gly Gln Leu Gln Val Glu Thr Tyr Ala Cys AlaLeu Ser Ser Val 305 310 315 320 Tyr Thr Phe Gly Pro Thr Phe Arg Ala GluAsn Ser His Thr Ser Arg 325 330 335 His Leu Ala Glu Phe Trp Met Val GluPro Glu Ile Ala Phe Ala Asn 340 345 350 Leu Gln Asp Asp Met Asn Cys AlaGlu Lys Tyr Val Gln Tyr Leu Cys 355 360 365 Lys Trp Leu Leu Asp His CysGln Glu Asp Met Glu Phe Met Val Lys 370 375 380 Asn Tyr Asp Lys Ser AlaIle Glu Arg Leu Glu Leu Val Ser Ser Thr 385 390 395 400 Pro Phe Val ArgIle Ser Tyr Thr Lys Ala Val Glu Leu Leu Lys Asn 405 410 415 Val Thr AspLys Lys Phe Asp Asn Lys Val Glu Trp Gly Ile Asp Leu 420 425 430 Ala SerGlu His Glu Arg Tyr Leu Thr Glu Asp Ile Phe Lys Lys Pro 435 440 445 ValIle Val Tyr Asn Tyr Pro Lys Gly Ile Lys Ala Phe Tyr Met Arg 450 455 460Leu Asn Asp Asp Asp Lys Thr Val Ala Ala Met Asp Val Leu Val Pro 465 470475 480 Lys Val Gly Glu Leu Ile Gly Gly Ser Gln Arg Glu Glu Arg Leu Asp485 490 495 Val Leu Lys Gln Arg Ile Leu Asp Ala Gly Leu Pro Leu Glu ProTyr 500 505 510 Glu Trp Tyr Leu Asp Leu Arg Arg Phe Gly Ser Val Lys HisSer Gly 515 520 525 Phe Gly Leu Gly Phe Glu Arg Met Ile Leu Phe Ala ThrGly Met Glu 530 535 540 Asn Ile Arg Asp Val Ile Pro Phe Pro Arg Tyr ProGly Arg Ala Asp 545 550 555 560 Leu <210> SEQ ID NO 155 <211> LENGTH:1105 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <400> SEQUENCE: 155gcacgagctt acacaggatt tcttctgcaa accagcattt ctgacagtgt ctgggcaact 60gaatggtgaa acatacgcta cagctctatc agatgtttac acttttggtc caacatttag 120agctgaaaat tcaaacacct caagacattt ggctgaattt tggatgattg agcctgaact 180tgcctttgcg gatctaaatg atgacatggc atgtgcgagt tcatatctcc agtatgtagt 240gaagtatgtt ctagagaact gcaaagaaga tatggatttc tttaatacat ggattgaaaa 300agggatcatc gatagattaa atgacgtagt tgagaaaaac tttgttcact tgtcatattc 360tgatgctatt gagctacttg ttgggtccaa gaagaaattt gagttcccgg tcaaatgggg 420attggatctg caaagtgagc atgaaagata tatcacagaa gttgcttttg gtgggcgccc 480ggtgataatt agagattatc caaaggaaat caaagctttc tatatgcgag agaatgatga 540tggtaaaaca gttgctgcaa tggatctatt ggttcctcgg gttggtgaac tcattggtgg 600aagccaaagg gaagaacgtc ttgattacct tgaagctcgc ttggatgagt taaatcttaa 660caaagatagc tactggtggt acttagatct acgtcgatat ggatcagttc ctcatgctgg 720ttttggtctt ggatttgaaa gactagtaca gtttgcaact ggaatggaca acattagaga 780caccattcca tttccccggg ttccaggctc tgcagagttt tagaatccaa cattgctcac 840aaatactttt tgcaacattc tttcctgaga ttgggatttc aagaaaagta tacactctac 900tgtttcattg tacagagttt agttaagtta ttgcttacat cgagcaaaac aatattgttc 960caaattttgt tgtttggagt aacgtgcttg caaatttcat tgaacttccg atgtacatta 1020ttgttactgg acgtgttata atgatcacct gatggattga tataaaaaaa aaaaaaaaaa 1080aaaaaaaaaa aaaaaaaaaa aaaaa 1105 <210> SEQ ID NO 156 <211> LENGTH: 273<212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 156 His GluLeu Thr Gln Asp Phe Phe Cys Lys Pro Ala Phe Leu Thr Val 1 5 10 15 SerGly Gln Leu Asn Gly Glu Thr Tyr Ala Thr Ala Leu Ser Asp Val 20 25 30 TyrThr Phe Gly Pro Thr Phe Arg Ala Glu Asn Ser Asn Thr Ser Arg 35 40 45 HisLeu Ala Glu Phe Trp Met Ile Glu Pro Glu Leu Ala Phe Ala Asp 50 55 60 LeuAsn Asp Asp Met Ala Cys Ala Ser Ser Tyr Leu Gln Tyr Val Val 65 70 75 80Lys Tyr Val Leu Glu Asn Cys Lys Glu Asp Met Asp Phe Phe Asn Thr 85 90 95Trp Ile Glu Lys Gly Ile Ile Asp Arg Leu Asn Asp Val Val Glu Lys 100 105110 Asn Phe Val His Leu Ser Tyr Ser Asp Ala Ile Glu Leu Leu Val Gly 115120 125 Ser Lys Lys Lys Phe Glu Phe Pro Val Lys Trp Gly Leu Asp Leu Gln130 135 140 Ser Glu His Glu Arg Tyr Ile Thr Glu Val Ala Phe Gly Gly ArgPro 145 150 155 160 Val Ile Ile Arg Asp Tyr Pro Lys Glu Ile Lys Ala PheTyr Met Arg 165 170 175 Glu Asn Asp Asp Gly Lys Thr Val Ala Ala Met AspLeu Leu Val Pro 180 185 190 Arg Val Gly Glu Leu Ile Gly Gly Ser Gln ArgGlu Glu Arg Leu Asp 195 200 205 Tyr Leu Glu Ala Arg Leu Asp Glu Leu AsnLeu Asn Lys Asp Ser Tyr 210 215 220 Trp Trp Tyr Leu Asp Leu Arg Arg TyrGly Ser Val Pro His Ala Gly 225 230 235 240 Phe Gly Leu Gly Phe Glu ArgLeu Val Gln Phe Ala Thr Gly Met Asp 245 250 255 Asn Ile Arg Asp Thr IlePro Phe Pro Arg Val Pro Gly Ser Ala Glu 260 265 270 Phe <210> SEQ ID NO157 <211> LENGTH: 1849 <212> TYPE: DNA <213> ORGANISM: Glycine max <400>SEQUENCE: 157 gcacgagagc accaagaagc aaatggccgt tagcattggc gttggcgccacaaaagcgat 60 atcaatggca atagcagcga ggcacctcgg aacaaaacct tacgcagcagcagcaacaac 120 aacggcactt gctcttctct cactccacaa acctcttttc ctccctcattcctctccctt 180 ctcctctcgc cgcgctttct gcgccgcgac tcttcgcacc gccgataacagagtgcaaca 240 gttccgcagg aagcttaggg tttccgaaat caaagaaggc gacggcgccgacgtgttcgg 300 ccgcaacctc gtcgtgcagg gctgggtccg cacgctacgc attcagagtatcgtcacctt 360 cctcgagatt aacgacggtt cttgcctttc taacatgcaa tgtgtgttgaattcggaggc 420 tgaaggttac gatcaggtag aatctggctt ggttaccacg ggtgcttcagtgtgggtgca 480 aggagttgtg gtgaagagtc aaggatcgaa acagaaggtt gaattgaaggtcaacaaaat 540 agtactgatt ggcaagagcg atccctcctt tcccatccaa aagaaaagagccagcagaga 600 atttctaaga acaaaagcac atcttcgtgc gaggacaaat acttttggtgcagttgcgag 660 ggttaggaat gcattggcat atgctacaca taagttcttc caagaaaatgggtttgtatg 720 ggtctccagt cctatcatca cagcatcaga ttgcgaggga gcgggtgaacagttttgcgt 780 tactaccttg ataccaagtt ctcatgaaac tactgattct cctgttgatgctattccaaa 840 aacaaatgac ggattaattg actggtcaca agatttcttt ggaaaacccgcattcttgac 900 tgtttcaggt caactcaatg gtgaaactta tgctacttct ctctctgatgtgtatacatt 960 tggtcccaca ttccgagcag aaaattctaa cacttctagg cacttggctgaattttggat 1020 gattgaaccg gagcttgcat ttgctgatct aaatgatgac atggcttgtgcaactgctta 1080 tctccagttt gtaataagac atgttctcga taattgcaag gaagacatggagtttttcga 1140 tacatggatt aataaaggaa tcattgatcg cttgagtgat gtagcagataaagatgttgt 1200 gcaaataacc tacactgaag cgatagatct gctgtcagga gcaaataagaaatttgaatt 1260 cccggtgaaa tggggtagtg atcttcagag tgaacatgag cgttatataactgaagaggc 1320 attcagtgga tgcccagtga taattagaga ctatccaaag gatattaaagcattctatat 1380 gcgacagaat gatgatggac ggacagttgc agccatggac atgttggttccagggattgg 1440 tgaacttatt ggtggaagcc aaagagaaga aaggcttgag tatctagaagctcgtttaga 1500 tgatttaaag ctgaataagg atgcatattg gtggtatctt gatttgcgtcgatatggttc 1560 agtacctcat gcaggctttg gtttggggtt tgaaagactt gtccaatttgcaacaggaat 1620 ggacaatata agggacgtta ttccttttcc tcgaacacct ggctcggctgagttttagac 1680 cttgcagata taagcacttc acttcacatt catttgtacg attcaaactgtattttaggc 1740 aataaagtat cttgagtctt tccttttctt ctttttgttt tgaaaaagaaaactgcttgc 1800 ttataaatga agtatttgta ctttatgaat ttagtagaat ggaattctg1849 <210> SEQ ID NO 158 <211> LENGTH: 551 <212> TYPE: PRT <213>ORGANISM: Glycine max <400> SEQUENCE: 158 Met Ala Val Ser Ile Gly ValGly Ala Thr Lys Ala Ile Ser Met Ala 1 5 10 15 Ile Ala Ala Arg His LeuGly Thr Lys Pro Tyr Ala Ala Ala Ala Thr 20 25 30 Thr Thr Ala Leu Ala LeuLeu Ser Leu His Lys Pro Leu Phe Leu Pro 35 40 45 His Ser Ser Pro Phe SerSer Arg Arg Ala Phe Cys Ala Ala Thr Leu 50 55 60 Arg Thr Ala Asp Asn ArgVal Gln Gln Phe Arg Arg Lys Leu Arg Val 65 70 75 80 Ser Glu Ile Lys GluGly Asp Gly Ala Asp Val Phe Gly Arg Asn Leu 85 90 95 Val Val Gln Gly TrpVal Arg Thr Leu Arg Ile Gln Ser Ile Val Thr 100 105 110 Phe Leu Glu IleAsn Asp Gly Ser Cys Leu Ser Asn Met Gln Cys Val 115 120 125 Leu Asn SerGlu Ala Glu Gly Tyr Asp Gln Val Glu Ser Gly Leu Val 130 135 140 Thr ThrGly Ala Ser Val Trp Val Gln Gly Val Val Val Lys Ser Gln 145 150 155 160Gly Ser Lys Gln Lys Val Glu Leu Lys Val Asn Lys Ile Val Leu Ile 165 170175 Gly Lys Ser Asp Pro Ser Phe Pro Ile Gln Lys Lys Arg Ala Ser Arg 180185 190 Glu Phe Leu Arg Thr Lys Ala His Leu Arg Ala Arg Thr Asn Thr Phe195 200 205 Gly Ala Val Ala Arg Val Arg Asn Ala Leu Ala Tyr Ala Thr HisLys 210 215 220 Phe Phe Gln Glu Asn Gly Phe Val Trp Val Ser Ser Pro IleIle Thr 225 230 235 240 Ala Ser Asp Cys Glu Gly Ala Gly Glu Gln Phe CysVal Thr Thr Leu 245 250 255 Ile Pro Ser Ser His Glu Thr Thr Asp Ser ProVal Asp Ala Ile Pro 260 265 270 Lys Thr Asn Asp Gly Leu Ile Asp Trp SerGln Asp Phe Phe Gly Lys 275 280 285 Pro Ala Phe Leu Thr Val Ser Gly GlnLeu Asn Gly Glu Thr Tyr Ala 290 295 300 Thr Ser Leu Ser Asp Val Tyr ThrPhe Gly Pro Thr Phe Arg Ala Glu 305 310 315 320 Asn Ser Asn Thr Ser ArgHis Leu Ala Glu Phe Trp Met Ile Glu Pro 325 330 335 Glu Leu Ala Phe AlaAsp Leu Asn Asp Asp Met Ala Cys Ala Thr Ala 340 345 350 Tyr Leu Gln PheVal Ile Arg His Val Leu Asp Asn Cys Lys Glu Asp 355 360 365 Met Glu PhePhe Asp Thr Trp Ile Asn Lys Gly Ile Ile Asp Arg Leu 370 375 380 Ser AspVal Ala Asp Lys Asp Val Val Gln Ile Thr Tyr Thr Glu Ala 385 390 395 400Ile Asp Leu Leu Ser Gly Ala Asn Lys Lys Phe Glu Phe Pro Val Lys 405 410415 Trp Gly Ser Asp Leu Gln Ser Glu His Glu Arg Tyr Ile Thr Glu Glu 420425 430 Ala Phe Ser Gly Cys Pro Val Ile Ile Arg Asp Tyr Pro Lys Asp Ile435 440 445 Lys Ala Phe Tyr Met Arg Gln Asn Asp Asp Gly Arg Thr Val AlaAla 450 455 460 Met Asp Met Leu Val Pro Gly Ile Gly Glu Leu Ile Gly GlySer Gln 465 470 475 480 Arg Glu Glu Arg Leu Glu Tyr Leu Glu Ala Arg LeuAsp Asp Leu Lys 485 490 495 Leu Asn Lys Asp Ala Tyr Trp Trp Tyr Leu AspLeu Arg Arg Tyr Gly 500 505 510 Ser Val Pro His Ala Gly Phe Gly Leu GlyPhe Glu Arg Leu Val Gln 515 520 525 Phe Ala Thr Gly Met Asp Asn Ile ArgAsp Val Ile Pro Phe Pro Arg 530 535 540 Thr Pro Gly Ser Ala Glu Phe 545550 <210> SEQ ID NO 159 <211> LENGTH: 372 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 159 gcacgagtct taacaaagatagctactggt ggtatttgga tctgcggcga tatggatcag 60 ttcctcatgc tggttttggccttggatttg aacggcttgt acagtttgca accggaatag 120 acaacatcag agacgccattccatttccca gggttcctgg ttctgcggag ttttagcact 180 tgaagagtgc gtgtataatctctaaaagca ttcttgcctc ggacttgaag aaaaccttac 240 attgatttgt tcattgtaaagagtttaccc tttgcaaatt cgtcgaatgc tgggatttta 300 gtcgatgaat tctttgtgtacgtccccatt cagtaaacag acaattttgc cttggggact 360 aaaaaaaaaa aa 372 <210>SEQ ID NO 160 <211> LENGTH: 57 <212> TYPE: PRT <213> ORGANISM: Triticumaestivum <400> SEQUENCE: 160 Thr Ser Leu Asn Lys Asp Ser Tyr Trp Trp TyrLeu Asp Leu Arg Arg 1 5 10 15 Tyr Gly Ser Val Pro His Ala Gly Phe GlyLeu Gly Phe Glu Arg Leu 20 25 30 Val Gln Phe Ala Thr Gly Ile Asp Asn IleArg Asp Ala Ile Pro Phe 35 40 45 Pro Arg Val Pro Gly Ser Ala Glu Phe 5055 <210> SEQ ID NO 161 <211> LENGTH: 573 <212> TYPE: DNA <213> ORGANISM:Triticum aestivum <220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION:(300) <221> NAME/KEY: unsure <222> LOCATION: (434) <221> NAME/KEY:unsure <222> LOCATION: (476) <221> NAME/KEY: unsure <222> LOCATION:(484) <221> NAME/KEY: unsure <222> LOCATION: (522) <221> NAME/KEY:unsure <222> LOCATION: (525) <221> NAME/KEY: unsure <222> LOCATION:(527) <221> NAME/KEY: unsure <222> LOCATION: (540) <221> NAME/KEY:unsure <222> LOCATION: (548) <221> NAME/KEY: unsure <222> LOCATION:(551) <400> SEQUENCE: 161 ccgccgccta cccgctgccc aagaccaaga tcacgctcgagacgctcagg gacttcgtcc 60 acctccgcgc acgcaccaac acgataggcg cagttgctcggataaggcac cagcttgcct 120 acgcaaccca cagttttttc gatgaaaatg gatttttgtatattcacacc cccataataa 180 ccaccagcga ctgtgagggt gcaggtgaga tgttccaagtcactgcctta ttcagccaag 240 gctgaaaagg tggagaagga gcttaaggag aaccctgcaccatcagaagc tgacgttgan 300 gctgctaagc ttgttgttaa agagaaagga gatgcagttgctcaattgaa agcagcaaaa 360 gctagcaagc aagagataac tgctgctgtt tccgtgcttacaaaagctaa agagatgtgt 420 taagggtgga agangctcca aattgaactg gacttcactcaaggatgatg gaaatncgtt 480 tganaagact ctcaacgtca acttctgacg ttcaggcaactnagncnaaa ttagctgtgn 540 ccataagngt nacttggcaa atccggaaag atc 573 <210>SEQ ID NO 162 <211> LENGTH: 139 <212> TYPE: PRT <213> ORGANISM: Triticumaestivum <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (99)<400> SEQUENCE: 162 Ala Ala Tyr Pro Leu Pro Lys Thr Lys Ile Thr Leu GluThr Leu Arg 1 5 10 15 Asp Phe Val His Leu Arg Ala Arg Thr Asn Thr IleGly Ala Val Ala 20 25 30 Arg Ile Arg His Gln Leu Ala Tyr Ala Thr His SerPhe Phe Asp Glu 35 40 45 Asn Gly Phe Leu Tyr Ile His Thr Pro Ile Ile ThrThr Ser Asp Cys 50 55 60 Glu Gly Ala Gly Glu Met Phe Gln Val Thr Ala LeuPhe Ser Lys Ala 65 70 75 80 Glu Lys Val Glu Lys Glu Leu Lys Glu Asn ProAla Pro Ser Glu Ala 85 90 95 Asp Val Xaa Ala Ala Lys Leu Val Val Lys GluLys Gly Asp Ala Val 100 105 110 Ala Gln Leu Lys Ala Ala Lys Ala Ser LysGln Glu Ile Thr Ala Ala 115 120 125 Val Ser Val Leu Thr Lys Ala Lys GluMet Cys 130 135 <210> SEQ ID NO 163 <211> LENGTH: 1464 <212> TYPE: DNA<213> ORGANISM: Zea mays <400> SEQUENCE: 163 ccacgcgtcc gcaccagactgcagaagaaa tcaaggaata cagggaaaag aagatggata 60 gtccatggag gggtaggccaattgaagagt ctctgaaatt atttgaagac atgaggcgtg 120 ggttgattgc agagggtgcagcaactctcc gtatgaagca ggatatgcaa aatgagaaca 180 aaaatatgtc tgacttaatagcatatagaa taaaattcac cccccatcca catgctggcg 240 acaagtggtg tgtctatccgagctatgact acgctcattg catggtggat tctcttgaaa 300 acatcacaca ttcgctgtgcacacttgagt ttgacatacg tcgcccttca tactactggc 360 tacttgttgc cttgggcctttatcaaccat atgtgtggga atattcgagg ctaaacatat 420 caaatactgt gatgtctaaaagaaagttga atcgacttgt gacagagaag tgggtagatg 480 ggtgggatga tccccgcttgttgacactgg ctggtctccg gcgacgggga gtatcatcaa 540 ctgcaataaa ttcctttattcgtggaatcg ggataacgag aagtgacaat agcttaattc 600 gtgttgatcg tctagaatatcacatcaggg aggagcttaa caaaacagcc cctcgaacca 660 tggctgtttt gcgacctctaaaggtggtaa taactaactt ggaagaagga aaagtactag 720 accttgatgg caaaatgtggcctgatgctt ctgatactga tgcttcctcc cactataagg 780 ttccgttctc aagaactgtctacattgaga aaactgattt tcgcctaaag gactcaaaag 840 actactatgg gctagcccctggtaaatctg tcatgctaag gtatgcgttc cccataaaat 900 gcacagatgt tatctctggtgatagtcctg atgatattgt tgaaattcga gctgaatatg 960 atcctttgaa gacttctaaactgaagggtg ttcttcactg gattgctgag ccagcacctg 1020 gtgtagagcc attgaaggtggaagtaagat tattcgagaa attgttcatg tcagagaatc 1080 ctgctgaatt ggaagattggcttggcgatc ttaacccaca ctcgaaagag gtgataaagg 1140 atgcttatgc tgtaccatcacttgccactg cggttctggg tgacaagttc cagtttgagc 1200 ggcttggtta cttcgccgtggatactgact ccacacctga gaaactcgtg ttcaacagaa 1260 ctgttaccct ccgtgattcgttcgggaaag ctggacccaa gtgactgttc agtgtaattt 1320 agggagggcg ctggttttgatcggttgcag aggcgcacct gaactataca gtttgtgaag 1380 aaaatggtcg tctaatacagaacagtttaa aggccttact ctttataaaa tttagggttt 1440 tttaaaaaaa aaaaaaaaaaaaag 1464 <210> SEQ ID NO 164 <211> LENGTH: 433 <212> TYPE: PRT <213>ORGANISM: Zea mays <400> SEQUENCE: 164 Thr Arg Pro His Gln Thr Ala GluGlu Ile Lys Glu Tyr Arg Glu Lys 1 5 10 15 Lys Met Asp Ser Pro Trp ArgGly Arg Pro Ile Glu Glu Ser Leu Lys 20 25 30 Leu Phe Glu Asp Met Arg ArgGly Leu Ile Ala Glu Gly Ala Ala Thr 35 40 45 Leu Arg Met Lys Gln Asp MetGln Asn Glu Asn Lys Asn Met Ser Asp 50 55 60 Leu Ile Ala Tyr Arg Ile LysPhe Thr Pro His Pro His Ala Gly Asp 65 70 75 80 Lys Trp Cys Val Tyr ProSer Tyr Asp Tyr Ala His Cys Met Val Asp 85 90 95 Ser Leu Glu Asn Ile ThrHis Ser Leu Cys Thr Leu Glu Phe Asp Ile 100 105 110 Arg Arg Pro Ser TyrTyr Trp Leu Leu Val Ala Leu Gly Leu Tyr Gln 115 120 125 Pro Tyr Val TrpGlu Tyr Ser Arg Leu Asn Ile Ser Asn Thr Val Met 130 135 140 Ser Lys ArgLys Leu Asn Arg Leu Val Thr Glu Lys Trp Val Asp Gly 145 150 155 160 TrpAsp Asp Pro Arg Leu Leu Thr Leu Ala Gly Leu Arg Arg Arg Gly 165 170 175Val Ser Ser Thr Ala Ile Asn Ser Phe Ile Arg Gly Ile Gly Ile Thr 180 185190 Arg Ser Asp Asn Ser Leu Ile Arg Val Asp Arg Leu Glu Tyr His Ile 195200 205 Arg Glu Glu Leu Asn Lys Thr Ala Pro Arg Thr Met Ala Val Leu Arg210 215 220 Pro Leu Lys Val Val Ile Thr Asn Leu Glu Glu Gly Lys Val LeuAsp 225 230 235 240 Leu Asp Gly Lys Met Trp Pro Asp Ala Ser Asp Thr AspAla Ser Ser 245 250 255 His Tyr Lys Val Pro Phe Ser Arg Thr Val Tyr IleGlu Lys Thr Asp 260 265 270 Phe Arg Leu Lys Asp Ser Lys Asp Tyr Tyr GlyLeu Ala Pro Gly Lys 275 280 285 Ser Val Met Leu Arg Tyr Ala Phe Pro IleLys Cys Thr Asp Val Ile 290 295 300 Ser Gly Asp Ser Pro Asp Asp Ile ValGlu Ile Arg Ala Glu Tyr Asp 305 310 315 320 Pro Leu Lys Thr Ser Lys LeuLys Gly Val Leu His Trp Ile Ala Glu 325 330 335 Pro Ala Pro Gly Val GluPro Leu Lys Val Glu Val Arg Leu Phe Glu 340 345 350 Lys Leu Phe Met SerGlu Asn Pro Ala Glu Leu Glu Asp Trp Leu Gly 355 360 365 Asp Leu Asn ProHis Ser Lys Glu Val Ile Lys Asp Ala Tyr Ala Val 370 375 380 Pro Ser LeuAla Thr Ala Val Leu Gly Asp Lys Phe Gln Phe Glu Arg 385 390 395 400 LeuGly Tyr Phe Ala Val Asp Thr Asp Ser Thr Pro Glu Lys Leu Val 405 410 415Phe Asn Arg Thr Val Thr Leu Arg Asp Ser Phe Gly Lys Ala Gly Pro 420 425430 Lys <210> SEQ ID NO 165 <211> LENGTH: 1992 <212> TYPE: DNA <213>ORGANISM: Oryza sativa <400> SEQUENCE: 165 gcacgagatt gaatccatactctatatttc ctcagccaga ggaaaatttt aaggttcata 60 cagaaatatt ctatagtgatgggaacatat ggagagcgca taacagtaag gagattttag 120 agaaacacct taaggcaaccggtggaaaag tgatgacccg tttcccacca gaacctaatg 180 gatatcttca tattggtcatgccaaggcta tgtttattga ttttggactg gcgaaagaga 240 gaaatggtca ttgttaccttaggtttgatg acacaaatcc agaagccgaa aagaaagagt 300 atattgacca cattcaggaaatcgtacact ggatgggatg ggagccctac aaagttacat 360 atacaagtga ttatttccaggctttatatg agcatgcagt tgagttaata cgaaaagggc 420 tagcctatgt ggatcaccagaccgcagaag aaatcaagga atacagggaa aagaaaatga 480 atagtccatg gagggatagacccattgaag aatcactgaa actatttgaa gacatgagac 540 gtgggttgat tgctgaaggtgcagcaacac tccgaatgaa acaagatatg cagaatgata 600 acaagaatat gtctgatttaatagcatata gaataaaatt cactcctcat ccacatgctg 660 gtgataagtg gtgcatctatccaagctatg actatgctca ctgcatggtg gattctcttg 720 agaacattac acattcgctgtgcacgctcg agtttgacat acgtcgcccg tcatactact 780 ggctacttgt tgccttgggcctgtaccagc catatgtttg ggagtattcg aggctaaaca 840 tatcgaatac tgtgatgtctaaaagaaagt tgaatcgact tgtgacagaa aagtgggtag 900 atgggtggga tgaccctcgtttgttgacac tagcaggatt gcggcgacgt ggagtgtcat 960 caactgcaat taattcgtttatttgtggaa ttggaataac aagaagtgac aatagcttaa 1020 ttcgggttga ccgtcttgaatatcatatca gagaagagct taataaaaca gcttcccgtg 1080 ccatggttgt gttgaatcctctaaaggttg taataactaa cttggaggat gaaaaagtca 1140 tagaccttga tggaaaaatgtggcctgatg ctcctgcaga cgatgcttca tcctactaca 1200 aggttccttt ctcaagaatcgtttacatcg aaaaaactga ttttcgtcta aaggactcga 1260 aagattacta cgggctagctcctggtaaat ctgccctgct aagatatgca ttccccatta 1320 aatgtaccga ggttgtttatggtgacaatc cagatgacat cattgaaatt cgagctgaat 1380 atgacccttc aaagactactaaacctaagg gtgttctgca ctgggttgct cagccagcac 1440 ctggagttga accacttaaggtggaagtaa gattatttga taaattattc ctctctgaga 1500 atcctgctga actggaggattggctgggtg atcttaaccc gaactcaaaa gaggtgatca 1560 agggtgccta cgccgtgccatcgcttgcga ctgcggttct tggtgacaag ttccagttcg 1620 agcggctagg ctacttcgcagtggacacag actcgacacc tgagaacatt gtgttcaaca 1680 ggacggttac cctgcgtgattcgtatggga aagctgggcc aaagtgattg ccaacttact 1740 gaattttcag taagtttctttacctgggtg cactgaggtt acacagtggg aagaaaactg 1800 cagtgtgata cataaccatagtgaaaaaaa agggcccttt cttgtaaaat ggacttgttt 1860 gacttgttat atgtttccatcaaattgtgc atgattgagt atgttctcat gtcaatgagg 1920 tttggtgatt tgacttctagcatttataaa tggtgcaaac caacattata aacagaaaaa 1980 aaaaaaaaaa aa 1992<210> SEQ ID NO 166 <211> LENGTH: 574 <212> TYPE: PRT <213> ORGANISM:Oryza sativa <400> SEQUENCE: 166 Thr Arg Leu Asn Pro Tyr Ser Ile Phe ProGln Pro Glu Glu Asn Phe 1 5 10 15 Lys Val His Thr Glu Ile Phe Tyr SerAsp Gly Asn Ile Trp Arg Ala 20 25 30 His Asn Ser Lys Glu Ile Leu Glu LysHis Leu Lys Ala Thr Gly Gly 35 40 45 Lys Val Met Thr Arg Phe Pro Pro GluPro Asn Gly Tyr Leu His Ile 50 55 60 Gly His Ala Lys Ala Met Phe Ile AspPhe Gly Leu Ala Lys Glu Arg 65 70 75 80 Asn Gly His Cys Tyr Leu Arg PheAsp Asp Thr Asn Pro Glu Ala Glu 85 90 95 Lys Lys Glu Tyr Ile Asp His IleGln Glu Ile Val His Trp Met Gly 100 105 110 Trp Glu Pro Tyr Lys Val ThrTyr Thr Ser Asp Tyr Phe Gln Ala Leu 115 120 125 Tyr Glu His Ala Val GluLeu Ile Arg Lys Gly Leu Ala Tyr Val Asp 130 135 140 His Gln Thr Ala GluGlu Ile Lys Glu Tyr Arg Glu Lys Lys Met Asn 145 150 155 160 Ser Pro TrpArg Asp Arg Pro Ile Glu Glu Ser Leu Lys Leu Phe Glu 165 170 175 Asp MetArg Arg Gly Leu Ile Ala Glu Gly Ala Ala Thr Leu Arg Met 180 185 190 LysGln Asp Met Gln Asn Asp Asn Lys Asn Met Ser Asp Leu Ile Ala 195 200 205Tyr Arg Ile Lys Phe Thr Pro His Pro His Ala Gly Asp Lys Trp Cys 210 215220 Ile Tyr Pro Ser Tyr Asp Tyr Ala His Cys Met Val Asp Ser Leu Glu 225230 235 240 Asn Ile Thr His Ser Leu Cys Thr Leu Glu Phe Asp Ile Arg ArgPro 245 250 255 Ser Tyr Tyr Trp Leu Leu Val Ala Leu Gly Leu Tyr Gln ProTyr Val 260 265 270 Trp Glu Tyr Ser Arg Leu Asn Ile Ser Asn Thr Val MetSer Lys Arg 275 280 285 Lys Leu Asn Arg Leu Val Thr Glu Lys Trp Val AspGly Trp Asp Asp 290 295 300 Pro Arg Leu Leu Thr Leu Ala Gly Leu Arg ArgArg Gly Val Ser Ser 305 310 315 320 Thr Ala Ile Asn Ser Phe Ile Cys GlyIle Gly Ile Thr Arg Ser Asp 325 330 335 Asn Ser Leu Ile Arg Val Asp ArgLeu Glu Tyr His Ile Arg Glu Glu 340 345 350 Leu Asn Lys Thr Ala Ser ArgAla Met Val Val Leu Asn Pro Leu Lys 355 360 365 Val Val Ile Thr Asn LeuGlu Asp Glu Lys Val Ile Asp Leu Asp Gly 370 375 380 Lys Met Trp Pro AspAla Pro Ala Asp Asp Ala Ser Ser Tyr Tyr Lys 385 390 395 400 Val Pro PheSer Arg Ile Val Tyr Ile Glu Lys Thr Asp Phe Arg Leu 405 410 415 Lys AspSer Lys Asp Tyr Tyr Gly Leu Ala Pro Gly Lys Ser Ala Leu 420 425 430 LeuArg Tyr Ala Phe Pro Ile Lys Cys Thr Glu Val Val Tyr Gly Asp 435 440 445Asn Pro Asp Asp Ile Ile Glu Ile Arg Ala Glu Tyr Asp Pro Ser Lys 450 455460 Thr Thr Lys Pro Lys Gly Val Leu His Trp Val Ala Gln Pro Ala Pro 465470 475 480 Gly Val Glu Pro Leu Lys Val Glu Val Arg Leu Phe Asp Lys LeuPhe 485 490 495 Leu Ser Glu Asn Pro Ala Glu Leu Glu Asp Trp Leu Gly AspLeu Asn 500 505 510 Pro Asn Ser Lys Glu Val Ile Lys Gly Ala Tyr Ala ValPro Ser Leu 515 520 525 Ala Thr Ala Val Leu Gly Asp Lys Phe Gln Phe GluArg Leu Gly Tyr 530 535 540 Phe Ala Val Asp Thr Asp Ser Thr Pro Glu AsnIle Val Phe Asn Arg 545 550 555 560 Thr Val Thr Leu Arg Asp Ser Tyr GlyLys Ala Gly Pro Lys 565 570 <210> SEQ ID NO 167 <211> LENGTH: 1258 <212>TYPE: DNA <213> ORGANISM: Glycine max <400> SEQUENCE: 167 ccacgcgtccggctaaatcg tctagttaca gagaagtggg ttgatgggtg ggatgatcct 60 cgtttgatgacactagctgg tttgcggcgt agaggcatga ccccaactgc aatcaatgct 120 tttgtccgaggaattggaat aactagaagt gatggcactt tgatttctgt ggaacgcctt 180 gaatatcatgttagggaaga attgaacaaa acagcacctc gtgcaatggt tgtcctacat 240 ccactcaaggttgtcattac taatcttgaa gccaactcag caattgaggt tgatgcaaag 300 aaatggcctgatgctcaagc tgatgatgct tctgctttct acaagattcc attttccaat 360 gttgtatatattgaacattc ggacttccgg atgcaagatt caaaagatta ttatggcctt 420 gctcctgggaaatctgtgat actcagatat gcatttccta taaagtgcac tgaagttatt 480 ctagctgatgataatgagac tattcttgaa attcgagccg agtatgatcc ttcaaagaag 540 accaagcctaagggggttct ccattgggtt gctcaacctt ctcctggagt tgatccattg 600 aaggtggaagtcagattgtt tgagaggcta ttcctatcag agaatcccgc tgaacttgac 660 aactggcttggcgatttgaa cccaaattcc aaagtgataa ttcccgatgc atatggtgtg 720 tcttccatacagaatgcaaa agttggggac aatttccaat ttgaaagatt aggctatttt 780 gtggttgaccgggactcgac atcagaaaaa cttgttttta ataggactgt caccttaaag 840 ggcagctatagcaaaggtgg aaagtagggt ccctgaatat tgattagaag taacaaggtg 900 ggacaggcaggtctgcgggt aatacttgtt taaatagcta cttataatgc atagtttttt 960 ctttgagggggtggggggtt atgtacaaac atatttactg gccacttagc catgtacatt 1020 ttcttaagggattactcaat agttccccaa ttatttaatt attttgtttg gttggttttg 1080 tagtttgaagattttttctg tttagtccca atgtagtacc aaattgctgt tattttccta 1140 ccattttgtgtaaaattagt tctgaagaat ctttcccttt acatttaagt gaaaatcaag 1200 tctaatttttgttcatgtga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaag 1258 <210> SEQ IDNO 168 <211> LENGTH: 288 <212> TYPE: PRT <213> ORGANISM: Glycine max<400> SEQUENCE: 168 Pro Arg Val Arg Leu Asn Arg Leu Val Thr Glu Lys TrpVal Asp Gly 1 5 10 15 Trp Asp Asp Pro Arg Leu Met Thr Leu Ala Gly LeuArg Arg Arg Gly 20 25 30 Met Thr Pro Thr Ala Ile Asn Ala Phe Val Arg GlyIle Gly Ile Thr 35 40 45 Arg Ser Asp Gly Thr Leu Ile Ser Val Glu Arg LeuGlu Tyr His Val 50 55 60 Arg Glu Glu Leu Asn Lys Thr Ala Pro Arg Ala MetVal Val Leu His 65 70 75 80 Pro Leu Lys Val Val Ile Thr Asn Leu Glu AlaAsn Ser Ala Ile Glu 85 90 95 Val Asp Ala Lys Lys Trp Pro Asp Ala Gln AlaAsp Asp Ala Ser Ala 100 105 110 Phe Tyr Lys Ile Pro Phe Ser Asn Val ValTyr Ile Glu His Ser Asp 115 120 125 Phe Arg Met Gln Asp Ser Lys Asp TyrTyr Gly Leu Ala Pro Gly Lys 130 135 140 Ser Val Ile Leu Arg Tyr Ala PhePro Ile Lys Cys Thr Glu Val Ile 145 150 155 160 Leu Ala Asp Asp Asn GluThr Ile Leu Glu Ile Arg Ala Glu Tyr Asp 165 170 175 Pro Ser Lys Lys ThrLys Pro Lys Gly Val Leu His Trp Val Ala Gln 180 185 190 Pro Ser Pro GlyVal Asp Pro Leu Lys Val Glu Val Arg Leu Phe Glu 195 200 205 Arg Leu PheLeu Ser Glu Asn Pro Ala Glu Leu Asp Asn Trp Leu Gly 210 215 220 Asp LeuAsn Pro Asn Ser Lys Val Ile Ile Pro Asp Ala Tyr Gly Val 225 230 235 240Ser Ser Ile Gln Asn Ala Lys Val Gly Asp Asn Phe Gln Phe Glu Arg 245 250255 Leu Gly Tyr Phe Val Val Asp Arg Asp Ser Thr Ser Glu Lys Leu Val 260265 270 Phe Asn Arg Thr Val Thr Leu Lys Gly Ser Tyr Ser Lys Gly Gly Lys275 280 285 <210> SEQ ID NO 169 <211> LENGTH: 1324 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 169 gcacgagagg acatgcagaatgataacaaa aacatgtctg atttaatagc atatagaata 60 aaattcactc ctcatccgcatgctggcgac aaatggtgta tctatccaag ctatgactat 120 gctcattgca tggtggattcacttgaaaac attacacatt ctttgtgcac gctcgagttc 180 gacattcgtc gcccgtcatactactggcta cttgttgcct tgggcttgta ccagccatat 240 gtttgggagt attcgaggctaaacatatca catactatga tgtccaaaag aaagttgaat 300 cggcttgtga cagagaagtgggtagatggg tgggatgacc ctcgtttgtt gactttggca 360 ggactgaggc gacggggagtatcagcaact gcgatcaatt catttatccg tggaattggg 420 ataacgagaa gtgacaatagcttaatccgt gttgaccgtc ttgaatatca tatcagagaa 480 gaactgaata aaacagcttctcggaccatg gttgttttgc atcctctgaa ggttgtaata 540 actaatttgg aagatggaaaagtcatagac cttgatggaa aaaagtggcc tgatgctcct 600 gctgatgaag cttcgtcctactacaaggtt cctttctcaa aaaccgtcta cattgaaaaa 660 actgattttc gcgtgaaggactccaaagat tactatggat tggctcctgg taaatctgcc 720 ctgctgaggt atgcattcccgataaaatgc acagaggtta tttatggtga taatccagat 780 gatattgttg aaattcgagccgagtatgac ccttcaaaga cctctaaacc taagggtgtt 840 ctgcactggg ttgccgagccagcacctgga gttgagccgt taaaggttga aataagatta 900 tttgagaaat tattcctctcggagaatccc gccgaattga aagactggct gggtgatatt 960 aacccgcact caaaggaggtagtcaagggc gcctacgctg taccgtcact cgccaccgcg 1020 gttctgggcg acaagttccagtttgagcgg cttggctact ttgctgtgga cacggactcg 1080 acgcccgaga accttgtgttgaacaggact gtgaccctgc gcgactctta tgggaaggct 1140 ggacccaagt gactgcaaatttagtgaaga agctccgttt ttcttggtca gagagacgct 1200 aatgggagaa aacatagtcatgacatacaa catgaaatgg ccaagaggcc ctttttgttt 1260 ttgttttgca tactgggcttgtgatgacta ttatagtatg ttttgtttca gtcgatcaaa 1320 aaaa 1324 <210> SEQ IDNO 170 <211> LENGTH: 383 <212> TYPE: PRT <213> ORGANISM: Triticumaestivum <400> SEQUENCE: 170 Ala Arg Glu Asp Met Gln Asn Asp Asn Lys AsnMet Ser Asp Leu Ile 1 5 10 15 Ala Tyr Arg Ile Lys Phe Thr Pro His ProHis Ala Gly Asp Lys Trp 20 25 30 Cys Ile Tyr Pro Ser Tyr Asp Tyr Ala HisCys Met Val Asp Ser Leu 35 40 45 Glu Asn Ile Thr His Ser Leu Cys Thr LeuGlu Phe Asp Ile Arg Arg 50 55 60 Pro Ser Tyr Tyr Trp Leu Leu Val Ala LeuGly Leu Tyr Gln Pro Tyr 65 70 75 80 Val Trp Glu Tyr Ser Arg Leu Asn IleSer His Thr Met Met Ser Lys 85 90 95 Arg Lys Leu Asn Arg Leu Val Thr GluLys Trp Val Asp Gly Trp Asp 100 105 110 Asp Pro Arg Leu Leu Thr Leu AlaGly Leu Arg Arg Arg Gly Val Ser 115 120 125 Ala Thr Ala Ile Asn Ser PheIle Arg Gly Ile Gly Ile Thr Arg Ser 130 135 140 Asp Asn Ser Leu Ile ArgVal Asp Arg Leu Glu Tyr His Ile Arg Glu 145 150 155 160 Glu Leu Asn LysThr Ala Ser Arg Thr Met Val Val Leu His Pro Leu 165 170 175 Lys Val ValIle Thr Asn Leu Glu Asp Gly Lys Val Ile Asp Leu Asp 180 185 190 Gly LysLys Trp Pro Asp Ala Pro Ala Asp Glu Ala Ser Ser Tyr Tyr 195 200 205 LysVal Pro Phe Ser Lys Thr Val Tyr Ile Glu Lys Thr Asp Phe Arg 210 215 220Val Lys Asp Ser Lys Asp Tyr Tyr Gly Leu Ala Pro Gly Lys Ser Ala 225 230235 240 Leu Leu Arg Tyr Ala Phe Pro Ile Lys Cys Thr Glu Val Ile Tyr Gly245 250 255 Asp Asn Pro Asp Asp Ile Val Glu Ile Arg Ala Glu Tyr Asp ProSer 260 265 270 Lys Thr Ser Lys Pro Lys Gly Val Leu His Trp Val Ala GluPro Ala 275 280 285 Pro Gly Val Glu Pro Leu Lys Val Glu Ile Arg Leu PheGlu Lys Leu 290 295 300 Phe Leu Ser Glu Asn Pro Ala Glu Leu Lys Asp TrpLeu Gly Asp Ile 305 310 315 320 Asn Pro His Ser Lys Glu Val Val Lys GlyAla Tyr Ala Val Pro Ser 325 330 335 Leu Ala Thr Ala Val Leu Gly Asp LysPhe Gln Phe Glu Arg Leu Gly 340 345 350 Tyr Phe Ala Val Asp Thr Asp SerThr Pro Glu Asn Leu Val Leu Asn 355 360 365 Arg Thr Val Thr Leu Arg AspSer Tyr Gly Lys Ala Gly Pro Lys 370 375 380 <210> SEQ ID NO 171 <211>LENGTH: 1398 <212> TYPE: DNA <213> ORGANISM: Oryza sativa <400>SEQUENCE: 171 gcacgagctt acatcatccc ccgcatacca cttactccgc ttgcatctgccacagaagga 60 attcaagctg ttctggattg gttatcccct caaactccga acttctcgccttctgggatg 120 ccactcatca taagtcagtt ttatgaaaat ttactgagaa gcacactttctattgcgagc 180 tatgaggcct gttccttcat gggatgcact agctcgatct taggaacattgacttccttc 240 attgagctct ccccttacag accctgtggg acttatcttt tcttgacaagcagtgaacag 300 ctggccgttc tgacaaattc agatgctgta ttgcagttgc tattttactgccttcagttg 360 gatccccagc aacaattgcg tgatgctgct gagagaagtt taagtgctcattggcaatat 420 gaaccaatta agcaaagcat gatgcaagag atagtttgcg tggattacttgggagtggtt 480 tcatcaacac ttcctggtag gcagatgagt agcacaatag ttggaggccttgaactgagc 540 aaggaagcta tgttgagcct ttctgcagct ggacaatggg agaagcaaagggagacaaac 600 caggcaaaaa tagatggtgc aagctgcacc aaaattaggg aagccctcaagtctctgaac 660 gagtacaaaa ggacatgtga ctccatgaag ttggaactag ctggactctgggatgagata 720 gttgaaatgc tgcgaaggcg tgagctgcca gatggtttcg aaagccgacaagattgggtg 780 aatctaggaa ccttgtaccg ccggttggtt gagcccttag acattgcaaactactatagg 840 cattccaaga acgaggacac tggctcctac ctctccaagg gaaggccaagacgttacaag 900 tacacccagg agtggcacga gcagtcgcag cgcatctcct tcggttctagccttgagtct 960 tgcttctggg caatggctga ggaactccag gctgagattg ctaacggcaaaacattcgag 1020 gatgtgagag atagagtagt taagcttgag agtgatgctc acggatggtccatgtctgga 1080 agtttaggga aggatatatt cctgagccgt tcttcttttg tgatttggtggaagacgctc 1140 ccggagaacc acagatctgc atcatgcatc gcgaaacttg tgccctggtaaggaagcttt 1200 acattatatg gatagaatta tttgccttca caatgatgag gcaggcattcgtggcatgtt 1260 gcaactactg ccactgtaac tgtataccat atgcatgaac tttgtaaatatgctacttcg 1320 cgtttttgta tcctacacaa tacaatcagt tcaagttgaa ccatttgtttcttgcatcaa 1380 aaaaaaaaaa aaaaaaaa 1398 <210> SEQ ID NO 172 <211>LENGTH: 396 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE:172 Ala Arg Ala Tyr Ile Ile Pro Arg Ile Pro Leu Thr Pro Leu Ala Ser 1 510 15 Ala Thr Glu Gly Ile Gln Ala Val Leu Asp Trp Leu Ser Pro Gln Thr 2025 30 Pro Asn Phe Ser Pro Ser Gly Met Pro Leu Ile Ile Ser Gln Phe Tyr 3540 45 Glu Asn Leu Leu Arg Ser Thr Leu Ser Ile Ala Ser Tyr Glu Ala Cys 5055 60 Ser Phe Met Gly Cys Thr Ser Ser Ile Leu Gly Thr Leu Thr Ser Phe 6570 75 80 Ile Glu Leu Ser Pro Tyr Arg Pro Cys Gly Thr Tyr Leu Phe Leu Thr85 90 95 Ser Ser Glu Gln Leu Ala Val Leu Thr Asn Ser Asp Ala Val Leu Gln100 105 110 Leu Leu Phe Tyr Cys Leu Gln Leu Asp Pro Gln Gln Gln Leu ArgAsp 115 120 125 Ala Ala Glu Arg Ser Leu Ser Ala His Trp Gln Tyr Glu ProIle Lys 130 135 140 Gln Ser Met Met Gln Glu Ile Val Cys Val Asp Tyr LeuGly Val Val 145 150 155 160 Ser Ser Thr Leu Pro Gly Arg Gln Met Ser SerThr Ile Val Gly Gly 165 170 175 Leu Glu Leu Ser Lys Glu Ala Met Leu SerLeu Ser Ala Ala Gly Gln 180 185 190 Trp Glu Lys Gln Arg Glu Thr Asn GlnAla Lys Ile Asp Gly Ala Ser 195 200 205 Cys Thr Lys Ile Arg Glu Ala LeuLys Ser Leu Asn Glu Tyr Lys Arg 210 215 220 Thr Cys Asp Ser Met Lys LeuGlu Leu Ala Gly Leu Trp Asp Glu Ile 225 230 235 240 Val Glu Met Leu ArgArg Arg Glu Leu Pro Asp Gly Phe Glu Ser Arg 245 250 255 Gln Asp Trp ValAsn Leu Gly Thr Leu Tyr Arg Arg Leu Val Glu Pro 260 265 270 Leu Asp IleAla Asn Tyr Tyr Arg His Ser Lys Asn Glu Asp Thr Gly 275 280 285 Ser TyrLeu Ser Lys Gly Arg Pro Arg Arg Tyr Lys Tyr Thr Gln Glu 290 295 300 TrpHis Glu Gln Ser Gln Arg Ile Ser Phe Gly Ser Ser Leu Glu Ser 305 310 315320 Cys Phe Trp Ala Met Ala Glu Glu Leu Gln Ala Glu Ile Ala Asn Gly 325330 335 Lys Thr Phe Glu Asp Val Arg Asp Arg Val Val Lys Leu Glu Ser Asp340 345 350 Ala His Gly Trp Ser Met Ser Gly Ser Leu Gly Lys Asp Ile PheLeu 355 360 365 Ser Arg Ser Ser Phe Val Ile Trp Trp Lys Thr Leu Pro GluAsn His 370 375 380 Arg Ser Ala Ser Cys Ile Ala Lys Leu Val Pro Trp 385390 395 <210> SEQ ID NO 173 <211> LENGTH: 2248 <212> TYPE: DNA <213>ORGANISM: Glycine max <400> SEQUENCE: 173 cccccgggct gcaggaattcggcacgaggt gaaactacaa ggtcatggtc aattcagtag 60 taattcaata tgtcctagttagttccccct ttatataata ccaagatgca gaagttctga 120 aaattaacca aactcaaaaatttccaaaac tttatagaat tagagggaca acatggctgg 180 agggttgctt ggagataaccttggattgaa ggaagatgtg atcaagaggg tgtgtggttt 240 ggcctccaaa gctcacaatcacaagtcaac ggacaagctc tacttttacg ataaggttcg 300 gacttcttcg gggacataccatgttttcag cttctcggga tcttgggatc ccgctgaatg 360 gttttttagc aaaccctttggcggatccaa gatagatcct acccaatttc cttcactcag 420 aagtattggt aacgatgaacctgctttggt gaacgaaggc ttcgcaaaga gattcgatcg 480 cgtattgaaa actagctttaaagccgaggt gaataaggct attggagatg ggaagcaagt 540 agtgtttacg gggcactcctctggtgctgc aatagccatt cttgctacct tttgggcatt 600 ggaagagtat cttaaccctacaaaaatcca aaaacccacg ccaccctttt gtgtcacttt 660 tgggtctccc ttaattggcaatcatatatt ctctcacgct tcaaggagag aaaattggtc 720 tcgctatttc atacactttgttttgagata tgacatagtg ccaaggattt tgctttctcg 780 cttggcttct attaagcaaacttttggttc tgttctccaa ttcttgaatc ccaattccaa 840 aacttccacc caggatccaacaagggctag tttaatttct gaattttaca aaactgtgat 900 gacaaacgca gcgagtgttacaagccatgc tgcgtgtatt ctcatgggaa gcacaagttt 960 gttacttggg acagtggcgaattttgttga gttgagccct tataggccct ttggaacatt 1020 tattttctgc aacggaaatggacaattgat tgtggtgaaa aactcagatg ctgttttgca 1080 actcttgttc cacactgctcagatgagcga tttggcagaa cttccagaag ttgccaatgt 1140 tagcatattg caacaccaggcctatgaggc tgaattggat gatagcttgg gaatgcagaa 1200 tgtagtgtac ttggagcaactagagcaact tccgttgtct gctgatggtt ctaatagcga 1260 tgttgcaaca atcagtgcagccttggatgg ccttggactg agcacaagag caaggctgtg 1320 tctacgagca gctggtgagttggaaaagca gaagctaaaa aatgaggaga aaatcaagaa 1380 ggagattcag gagaaagccgtgccaagcat gacgaagctt caaaattaca aaacaacatg 1440 tgagatgcac aaggggaagggctattacga tgccttcaag gtgcaaaatg aggaaaacga 1500 cttccaagca aatgtgaagaggcttgtgtt agcaggggta tgggatgaag tgattgagat 1560 gctaaagagg tatgaactcccagacgagtt tgaagggaat tcaaaatgga ttgaacatgg 1620 aactgaattt cgtcgccttgtggagcctct agacattgca aactatcacc gccacttgaa 1680 gaatgaggac acaggaccttacatgataag ggccaggcca aaacggtata ggtacactca 1740 aagatggttg gagcatgctaaaagggtgcc aaaacctgct cctatcactg aatcaacctt 1800 ttgggctgaa gtagaagagctttatagctg gattaacagc aagaggcatc ttgatgatga 1860 agtaaagcaa agggttgtgcagcttcagaa agaccttaaa aagtggactg atgatgagaa 1920 ggtactaacc aaggatacgttcttgaagga tcctaacttt attaggtggt gggatattct 1980 acctcaggaa ctcagggtcacttccttcta gtcttgttac tgctgtggaa ggatagtgaa 2040 ttggtgtttt tttcccctctaaattaatgg agaagctata gaaatttcgg catgcaggtg 2100 cagtgttggc tacttcggtgttgtaacttt gtattcccaa tgtttatgat tcccaagttt 2160 tgtatgactt gtgagtgagaaccccaaatc attcagactg ttgaactaaa taaatttgtt 2220 taatctaggg tctgtattagtgggttag 2248 <210> SEQ ID NO 174 <211> LENGTH: 612 <212> TYPE: PRT<213> ORGANISM: Glycine max <400> SEQUENCE: 174 Met Ala Gly Gly Leu LeuGly Asp Asn Leu Gly Leu Lys Glu Asp Val 1 5 10 15 Ile Lys Arg Val CysGly Leu Ala Ser Lys Ala His Asn His Lys Ser 20 25 30 Thr Asp Lys Leu TyrPhe Tyr Asp Lys Val Arg Thr Ser Ser Gly Thr 35 40 45 Tyr His Val Phe SerPhe Ser Gly Ser Trp Asp Pro Ala Glu Trp Phe 50 55 60 Phe Ser Lys Pro PheGly Gly Ser Lys Ile Asp Pro Thr Gln Phe Pro 65 70 75 80 Ser Leu Arg SerIle Gly Asn Asp Glu Pro Ala Leu Val Asn Glu Gly 85 90 95 Phe Ala Lys ArgPhe Asp Arg Val Leu Lys Thr Ser Phe Lys Ala Glu 100 105 110 Val Asn LysAla Ile Gly Asp Gly Lys Gln Val Val Phe Thr Gly His 115 120 125 Ser SerGly Ala Ala Ile Ala Ile Leu Ala Thr Phe Trp Ala Leu Glu 130 135 140 GluTyr Leu Asn Pro Thr Lys Ile Gln Lys Pro Thr Pro Pro Phe Cys 145 150 155160 Val Thr Phe Gly Ser Pro Leu Ile Gly Asn His Ile Phe Ser His Ala 165170 175 Ser Arg Arg Glu Asn Trp Ser Arg Tyr Phe Ile His Phe Val Leu Arg180 185 190 Tyr Asp Ile Val Pro Arg Ile Leu Leu Ser Arg Leu Ala Ser IleLys 195 200 205 Gln Thr Phe Gly Ser Val Leu Gln Phe Leu Asn Pro Asn SerLys Thr 210 215 220 Ser Thr Gln Asp Pro Thr Arg Ala Ser Leu Ile Ser GluPhe Tyr Lys 225 230 235 240 Thr Val Met Thr Asn Ala Ala Ser Val Thr SerHis Ala Ala Cys Ile 245 250 255 Leu Met Gly Ser Thr Ser Leu Leu Leu GlyThr Val Ala Asn Phe Val 260 265 270 Glu Leu Ser Pro Tyr Arg Pro Phe GlyThr Phe Ile Phe Cys Asn Gly 275 280 285 Asn Gly Gln Leu Ile Val Val LysAsn Ser Asp Ala Val Leu Gln Leu 290 295 300 Leu Phe His Thr Ala Gln MetSer Asp Leu Ala Glu Leu Pro Glu Val 305 310 315 320 Ala Asn Val Ser IleLeu Gln His Gln Ala Tyr Glu Ala Glu Leu Asp 325 330 335 Asp Ser Leu GlyMet Gln Asn Val Val Tyr Leu Glu Gln Leu Glu Gln 340 345 350 Leu Pro LeuSer Ala Asp Gly Ser Asn Ser Asp Val Ala Thr Ile Ser 355 360 365 Ala AlaLeu Asp Gly Leu Gly Leu Ser Thr Arg Ala Arg Leu Cys Leu 370 375 380 ArgAla Ala Gly Glu Leu Glu Lys Gln Lys Leu Lys Asn Glu Glu Lys 385 390 395400 Ile Lys Lys Glu Ile Gln Glu Lys Ala Val Pro Ser Met Thr Lys Leu 405410 415 Gln Asn Tyr Lys Thr Thr Cys Glu Met His Lys Gly Lys Gly Tyr Tyr420 425 430 Asp Ala Phe Lys Val Gln Asn Glu Glu Asn Asp Phe Gln Ala AsnVal 435 440 445 Lys Arg Leu Val Leu Ala Gly Val Trp Asp Glu Val Ile GluMet Leu 450 455 460 Lys Arg Tyr Glu Leu Pro Asp Glu Phe Glu Gly Asn SerLys Trp Ile 465 470 475 480 Glu His Gly Thr Glu Phe Arg Arg Leu Val GluPro Leu Asp Ile Ala 485 490 495 Asn Tyr His Arg His Leu Lys Asn Glu AspThr Gly Pro Tyr Met Ile 500 505 510 Arg Ala Arg Pro Lys Arg Tyr Arg TyrThr Gln Arg Trp Leu Glu His 515 520 525 Ala Lys Arg Val Pro Lys Pro AlaPro Ile Thr Glu Ser Thr Phe Trp 530 535 540 Ala Glu Val Glu Glu Leu TyrSer Trp Ile Asn Ser Lys Arg His Leu 545 550 555 560 Asp Asp Glu Val LysGln Arg Val Val Gln Leu Gln Lys Asp Leu Lys 565 570 575 Lys Trp Thr AspAsp Glu Lys Val Leu Thr Lys Asp Thr Phe Leu Lys 580 585 590 Asp Pro AsnPhe Ile Arg Trp Trp Asp Ile Leu Pro Gln Glu Leu Arg 595 600 605 Val ThrSer Phe 610 <210> SEQ ID NO 175 <211> LENGTH: 1532 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <220> FEATURE: <221> NAME/KEY: unsure <222>LOCATION: (286) <400> SEQUENCE: 175 gcacgaggtg acctttgggg cacctcttgttggcgataat gttttcaacc atgctgttag 60 aagggagggc tggtcgcagt gtattttgcatttcatcatg ccactggaca tcatcccgcg 120 aatactgctg gcccccctcg catcctctagagaacaaatt cagtccgttc tggattggtt 180 atctcctcac agtccaaact tctcacctgttgggaattcg cttgttattc cagagtttta 240 tgaaactttg ttgagaagca ccttatctattgccagctac gaggcntgtt ctttcatggg 300 atgcactagc tcaatcctag gaacgctgacttactttatt gagctatccc cgtacagacc 360 ctgtgggact taccttttct tgacaagtaccgatcaactg attgttctca caaactcaga 420 tgctgtatta cagttgctat tttactgccttcaattggat ccccaacaac aattgcttga 480 tgctgccgcc agaagtttaa gtgctcactgggaatatgaa tcgattaagc aaagtgtgat 540 gcaagagata ggttgtgtgg attacctaagagcaatttca tcatcccttc ttggcacaca 600 gatgaatggg acagcaattg gtggccttgaactgagtaag gaagctatgc tgagccttgc 660 tgctgctgca caatgggaga aacaaagagagataaaccaa gcaaagatag acgcaaactg 720 cagtaaaatt caggaagccc tcaagtccctgaacgagtac aaaagaacat gcgagctgca 780 tgaagtgagc tactatgatt ccttcaagcttcagcgggaa gtgcatgact tcaattcaaa 840 tgtacggagg ttggaactag ccggcttttgggacgagata attgagatgt tgcgaaggcg 900 tgagcttcca gacgcgttcg aagggcgagaagagtgggtg aacctgggga cctcgtaccg 960 ccggctggtt gagcccctgg acattgcaaactactacagg cattccaaga atgaggacac 1020 cggctcctac ctctccaagg gcaggccgagacgttacaag tacacccaga aatggcgcga 1080 gcagtcgcat cgcatcccct tgggttccagcctcgagtcg tgcttctggg caatgtccga 1140 ggagctgcag gctgagatga tcaacggcaaatcattcgag gatctgaaag acagggtggg 1200 taaactagag agcgatgcac ttggatggttcacctcggga aatcttggca gggatgtgtt 1260 tctgagcagc tcatcctttg tgatatggtggaagacgctc ccggagcagc acaggtccgc 1320 gtcttgcatc gcgagacttg tgccttcgtaagagaatttc tccgtcgctt gatgaggtag 1380 ccatttccga tctaatcgga actgctggcactggatattg taagcctgaa ctctcatgta 1440 aagtcatttg tgtgtagtat tctccaggatgaacagacta tttaaaatgc taaacagttg 1500 ttgtccaata aaaaaaaaaa aaaaaaaaaaaa 1532 <210> SEQ ID NO 176 <211> LENGTH: 449 <212> TYPE: PRT <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 176 His Glu Val Thr Phe GlyAla Pro Leu Val Gly Asp Asn Val Phe Asn 1 5 10 15 His Ala Val Arg ArgGlu Gly Trp Ser Gln Cys Ile Leu His Phe Ile 20 25 30 Met Pro Leu Asp IleIle Pro Arg Ile Leu Leu Ala Pro Leu Ala Ser 35 40 45 Ser Arg Glu Gln IleGln Ser Val Leu Asp Trp Leu Ser Pro His Ser 50 55 60 Pro Asn Phe Ser ProVal Gly Asn Ser Leu Val Ile Pro Glu Phe Tyr 65 70 75 80 Glu Thr Leu LeuArg Ser Thr Leu Ser Ile Ala Ser Tyr Glu Ala Cys 85 90 95 Ser Phe Met GlyCys Thr Ser Ser Ile Leu Gly Thr Leu Thr Tyr Phe 100 105 110 Ile Glu LeuSer Pro Tyr Arg Pro Cys Gly Thr Tyr Leu Phe Leu Thr 115 120 125 Ser ThrAsp Gln Leu Ile Val Leu Thr Asn Ser Asp Ala Val Leu Gln 130 135 140 LeuLeu Phe Tyr Cys Leu Gln Leu Asp Pro Gln Gln Gln Leu Leu Asp 145 150 155160 Ala Ala Ala Arg Ser Leu Ser Ala His Trp Glu Tyr Glu Ser Ile Lys 165170 175 Gln Ser Val Met Gln Glu Ile Gly Cys Val Asp Tyr Leu Arg Ala Ile180 185 190 Ser Ser Ser Leu Leu Gly Thr Gln Met Asn Gly Thr Ala Ile GlyGly 195 200 205 Leu Glu Leu Ser Lys Glu Ala Met Leu Ser Leu Ala Ala AlaAla Gln 210 215 220 Trp Glu Lys Gln Arg Glu Ile Asn Gln Ala Lys Ile AspAla Asn Cys 225 230 235 240 Ser Lys Ile Gln Glu Ala Leu Lys Ser Leu AsnGlu Tyr Lys Arg Thr 245 250 255 Cys Glu Leu His Glu Val Ser Tyr Tyr AspSer Phe Lys Leu Gln Arg 260 265 270 Glu Val His Asp Phe Asn Ser Asn ValArg Arg Leu Glu Leu Ala Gly 275 280 285 Phe Trp Asp Glu Ile Ile Glu MetLeu Arg Arg Arg Glu Leu Pro Asp 290 295 300 Ala Phe Glu Gly Arg Glu GluTrp Val Asn Leu Gly Thr Ser Tyr Arg 305 310 315 320 Arg Leu Val Glu ProLeu Asp Ile Ala Asn Tyr Tyr Arg His Ser Lys 325 330 335 Asn Glu Asp ThrGly Ser Tyr Leu Ser Lys Gly Arg Pro Arg Arg Tyr 340 345 350 Lys Tyr ThrGln Lys Trp Arg Glu Gln Ser His Arg Ile Pro Leu Gly 355 360 365 Ser SerLeu Glu Ser Cys Phe Trp Ala Met Ser Glu Glu Leu Gln Ala 370 375 380 GluMet Ile Asn Gly Lys Ser Phe Glu Asp Leu Lys Asp Arg Val Gly 385 390 395400 Lys Leu Glu Ser Asp Ala Leu Gly Trp Phe Thr Ser Gly Asn Leu Gly 405410 415 Arg Asp Val Phe Leu Ser Ser Ser Ser Phe Val Ile Trp Trp Lys Thr420 425 430 Leu Pro Glu Gln His Arg Ser Ala Ser Cys Ile Ala Arg Leu ValPro 435 440 445 Ser <210> SEQ ID NO 177 <211> LENGTH: 757 <212> TYPE:DNA <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: unsure <222>LOCATION: (542) <221> NAME/KEY: unsure <222> LOCATION: (603) <221>NAME/KEY: unsure <222> LOCATION: (678) <221> NAME/KEY: unsure <222>LOCATION: (725) <221> NAME/KEY: unsure <222> LOCATION: (744) <400>SEQUENCE: 177 ggatctaacc aaaccccgcg ccctcctcac cgtccggcga gctacggactcagcagatca 60 ccgtcgctcg agttgtacct gaaggcgcgc ccgtggaacc ggccgcgagataagggcggc 120 gggaaggcgg gcgacgatgc cggtggcagc gtcggccatc tacttcctcaaccttcgcgg 180 ggacgtcctc atcaaccgcc tctaccgtga tgatgttggg ggaaatatggttgatgcgtt 240 cagaatgcat atcatgcaaa caaaagaact tggcacatgc cctgttcgtcaaataggagg 300 ctgctccttc ctttatatga ggatcagtaa tgtttacatt gtgatcgtagttagcagcaa 360 tgctaatgtt gcatgtgctt tcaaatttgt tgtcgaggcg gtggctctcttcaagtccta 420 cttcggtgga gcttttgatg aagacgctat caggaataac tttgttttgatatatgaact 480 tcttgatgag atcatggatt ttggttatcc tcaaaacctg tcacctgaaattttgaagtt 540 gnatataact caagaaggtg ttcggtcacc attttcctca aaagcccttaaaagaacctg 600 ttncaaatgc gactcttcaa gtcactggcg cttgttgggt tggagaagaaaaggacttgt 660 gtaccaagaa agaatgangg tttcttggac atagttgaaa agtgggaacccttcttatgg 720 cttcnaaaag ggagtggtct taanaatgtg atgtgac 757 <210> SEQ IDNO 178 <211> LENGTH: 185 <212> TYPE: PRT <213> ORGANISM: Zea mays <220>FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (136) <221> NAME/KEY:UNSURE <222> LOCATION: (156) <221> NAME/KEY: UNSURE <222> LOCATION:(181) <400> SEQUENCE: 178 Met Pro Val Ala Ala Ser Ala Ile Tyr Phe LeuAsn Leu Arg Gly Asp 1 5 10 15 Val Leu Ile Asn Arg Leu Tyr Arg Asp AspVal Gly Gly Asn Met Val 20 25 30 Asp Ala Phe Arg Met His Ile Met Gln ThrLys Glu Leu Gly Thr Cys 35 40 45 Pro Val Arg Gln Ile Gly Gly Cys Ser PheLeu Tyr Met Arg Ile Ser 50 55 60 Asn Val Tyr Ile Val Ile Val Val Ser SerAsn Ala Asn Val Ala Cys 65 70 75 80 Ala Phe Lys Phe Val Val Glu Ala ValAla Leu Phe Lys Ser Tyr Phe 85 90 95 Gly Gly Ala Phe Asp Glu Asp Ala IleArg Asn Asn Phe Val Leu Ile 100 105 110 Tyr Glu Leu Leu Asp Glu Ile MetAsp Phe Gly Tyr Pro Gln Asn Leu 115 120 125 Ser Pro Glu Ile Leu Lys LeuXaa Ile Thr Gln Glu Gly Val Arg Ser 130 135 140 Pro Phe Ser Ser Lys AlaLeu Lys Arg Thr Cys Xaa Lys Cys Asp Ser 145 150 155 160 Ser Ser His TrpArg Leu Leu Gly Trp Arg Arg Lys Gly Leu Val Tyr 165 170 175 Gln Glu ArgMet Xaa Val Ser Trp Thr 180 185 <210> SEQ ID NO 179 <211> LENGTH: 831<212> TYPE: DNA <213> ORGANISM: Oryza sativa <400> SEQUENCE: 179gcacgaggtt taaactatcg aatcactgag ggtgtaaatc tcccattccg ggttctacca 60acgattaagg aattgggccg aacacgcatg gagataaatg ttaaagttaa gagtgttttt 120ggtgctaaga tgtttgcact tggtgttgtc gtcaaagtcc ctgttccaaa acaaacagca 180aagactagtt tccaaaccac atctgggaaa gccaaatata atgcttcaat tgattcccta 240gtgtggaaga tcaggaagtt tcctggacaa actgaggcaa ccatgagtgc cgaagttgag 300ctgatttcta caatggggga aaagaaatca tggaacaggc cgccaattca gatggaattc 360caggttccta tgtttactgc ttctgggtta cgagttcggt tccttaaggt gtgggagaag 420agcggttaca acaccgttga gtgggttcgc tacatcacaa gggctggatc atatgagata 480aggtgttagc gatgaagaaa aaagtcccag gtatttggca cggttatttc aaggattcgc 540ggagctttct ttctagtacg gcgtcatgaa tgatcataaa gcatgtatat tacatatgac 600accacttttt ctttaagcta ttactttggc aaagggtaga gcagagcaga caaattttcc 660tgggttcttg taatcgtttt gacacgggat atctagtatc ccctgagtct tgtttgtgtg 720gtagggcata ttttcatttg tatatgagaa gatggcggta gcgtttgttc ttgttgatca 780atatacatat actcattggc agttaaaaaa aaaaaaaaaa aaaaaaaaaa a 831 <210> SEQID NO 180 <211> LENGTH: 162 <212> TYPE: PRT <213> ORGANISM: Oryza sativa<400> SEQUENCE: 180 Ala Arg Gly Leu Asn Tyr Arg Ile Thr Glu Gly Val AsnLeu Pro Phe 1 5 10 15 Arg Val Leu Pro Thr Ile Lys Glu Leu Gly Arg ThrArg Met Glu Ile 20 25 30 Asn Val Lys Val Lys Ser Val Phe Gly Ala Lys MetPhe Ala Leu Gly 35 40 45 Val Val Val Lys Val Pro Val Pro Lys Gln Thr AlaLys Thr Ser Phe 50 55 60 Gln Thr Thr Ser Gly Lys Ala Lys Tyr Asn Ala SerIle Asp Ser Leu 65 70 75 80 Val Trp Lys Ile Arg Lys Phe Pro Gly Gln ThrGlu Ala Thr Met Ser 85 90 95 Ala Glu Val Glu Leu Ile Ser Thr Met Gly GluLys Lys Ser Trp Asn 100 105 110 Arg Pro Pro Ile Gln Met Glu Phe Gln ValPro Met Phe Thr Ala Ser 115 120 125 Gly Leu Arg Val Arg Phe Leu Lys ValTrp Glu Lys Ser Gly Tyr Asn 130 135 140 Thr Val Glu Trp Val Arg Tyr IleThr Arg Ala Gly Ser Tyr Glu Ile 145 150 155 160 Arg Cys <210> SEQ ID NO181 <211> LENGTH: 1776 <212> TYPE: DNA <213> ORGANISM: Glycine max <400>SEQUENCE: 181 gcacgagctg aattgtcaaa ttcacccgct accttttccc gtgtacgaacgaatccaatc 60 caccaaatcg gcggcgacgc ctccgctctg tctcgaaacc acagatctgcggcggaaccg 120 ccgtctccga ctatctccga ttgatcttcc ggtaaagtgg accggagatgccggttgccg 180 cttccgccat ttacttcctc aacctccgcg gcgacgttct catcaaccgcctctatcgcg 240 acgatgtcgg gggaaatatg gtggatgctt ttaggacgca tattatgcaaacgaaggagc 300 ttggtacttg cccagtgagg cagattggtg gatgctcttt cttttacatgaggatcagta 360 atgtctacat tgtcattgtt gtcagcaaca acgccaatgt cgcttgcgctttcaagtttg 420 ttgttgaggc tgttgcattg ttccgatcat actttggtgg agcttttgacgaggatgcaa 480 ttcgcaataa ttttgtactc atttatgagc tcctagatga aattatggactttggttacc 540 cacaaaatct ttcaccagag atcttaaagc tttatatcac tcaggaaggagtgcgttccc 600 cattttcatc aaagcccaca gatagacctg ttccaaatgc aactttacaagttactggtg 660 ctgttggttg gcggagagaa gggcttgtgt acaaaaagaa tgaggtcttcctggatattg 720 tggaaagtgt aaatcttcta atgtcttcaa aaggtagtgt tctgcgttgtgatgtcactg 780 ggaagattct catgaagtgc tttctttctg gaatgcctga tttaaagttgggtttgaatg 840 acaagattgg tcttgagaaa gaagcacaac ttaaatcccg tcctactaaaagtggtaaaa 900 gtatcgagct tgatgatgtc actttccatc aatgtgtaaa tttgacaaggttcaactcag 960 agaagacagt tagttttgta ccacctgatg gtgaatttga actaatgaagtatcgtataa 1020 ctgagggagt taatcttccg ttcaaagtat tgccaaccat caaggaacttggtcgatcac 1080 ggatagaagt gaacgttaag gtaaagagtg tttttggtgc aaaaatgtttgcacttgggg 1140 ttgtagtcaa gattcctgtt ccaaaacaaa cagcaaaaac aaatttcacagttacatctg 1200 gccgggcaaa atacaatgct tctattgatt gtttggtttg gaagattagaaaattccctg 1260 ggcagactga gtcaacctta agtgcagaag ttgaacttat ttccacaataacagaaaaga 1320 aatcttggac tagaccacca attcagatgg agtttcaggt tcccatgttcacagcatctg 1380 gtttacgtgt tcgtttcctc aaggtgtggg agaagagtgg gtacaataccgttgagtggg 1440 ttcgttacat tacaaaagct ggatcctacg agattaggtg ctagacagagtttgtattac 1500 tggatggaga atgaagcaaa atcccgaata agccgcagtt caggctagtcatcctagttt 1560 tcgtattgta ttaaacatgt aatgaaaggt tcaggagtta ctcaatgtgctgtcagtatg 1620 attttttgtg tgaaaatctg tttttaatat ttttgttcca aaaggccagcttgttattgg 1680 aacctgaagt tgagtgttct cttgtaaact ttgtattttg ttcttatatctctttttata 1740 taaaacatca tttattgtaa aaaaaaaaaa aaaaaa 1776 <210> SEQID NO 182 <211> LENGTH: 438 <212> TYPE: PRT <213> ORGANISM: Glycine max<400> SEQUENCE: 182 Met Pro Val Ala Ala Ser Ala Ile Tyr Phe Leu Asn LeuArg Gly Asp 1 5 10 15 Val Leu Ile Asn Arg Leu Tyr Arg Asp Asp Val GlyGly Asn Met Val 20 25 30 Asp Ala Phe Arg Thr His Ile Met Gln Thr Lys GluLeu Gly Thr Cys 35 40 45 Pro Val Arg Gln Ile Gly Gly Cys Ser Phe Phe TyrMet Arg Ile Ser 50 55 60 Asn Val Tyr Ile Val Ile Val Val Ser Asn Asn AlaAsn Val Ala Cys 65 70 75 80 Ala Phe Lys Phe Val Val Glu Ala Val Ala LeuPhe Arg Ser Tyr Phe 85 90 95 Gly Gly Ala Phe Asp Glu Asp Ala Ile Arg AsnAsn Phe Val Leu Ile 100 105 110 Tyr Glu Leu Leu Asp Glu Ile Met Asp PheGly Tyr Pro Gln Asn Leu 115 120 125 Ser Pro Glu Ile Leu Lys Leu Tyr IleThr Gln Glu Gly Val Arg Ser 130 135 140 Pro Phe Ser Ser Lys Pro Thr AspArg Pro Val Pro Asn Ala Thr Leu 145 150 155 160 Gln Val Thr Gly Ala ValGly Trp Arg Arg Glu Gly Leu Val Tyr Lys 165 170 175 Lys Asn Glu Val PheLeu Asp Ile Val Glu Ser Val Asn Leu Leu Met 180 185 190 Ser Ser Lys GlySer Val Leu Arg Cys Asp Val Thr Gly Lys Ile Leu 195 200 205 Met Lys CysPhe Leu Ser Gly Met Pro Asp Leu Lys Leu Gly Leu Asn 210 215 220 Asp LysIle Gly Leu Glu Lys Glu Ala Gln Leu Lys Ser Arg Pro Thr 225 230 235 240Lys Ser Gly Lys Ser Ile Glu Leu Asp Asp Val Thr Phe His Gln Cys 245 250255 Val Asn Leu Thr Arg Phe Asn Ser Glu Lys Thr Val Ser Phe Val Pro 260265 270 Pro Asp Gly Glu Phe Glu Leu Met Lys Tyr Arg Ile Thr Glu Gly Val275 280 285 Asn Leu Pro Phe Lys Val Leu Pro Thr Ile Lys Glu Leu Gly ArgSer 290 295 300 Arg Ile Glu Val Asn Val Lys Val Lys Ser Val Phe Gly AlaLys Met 305 310 315 320 Phe Ala Leu Gly Val Val Val Lys Ile Pro Val ProLys Gln Thr Ala 325 330 335 Lys Thr Asn Phe Thr Val Thr Ser Gly Arg AlaLys Tyr Asn Ala Ser 340 345 350 Ile Asp Cys Leu Val Trp Lys Ile Arg LysPhe Pro Gly Gln Thr Glu 355 360 365 Ser Thr Leu Ser Ala Glu Val Glu LeuIle Ser Thr Ile Thr Glu Lys 370 375 380 Lys Ser Trp Thr Arg Pro Pro IleGln Met Glu Phe Gln Val Pro Met 385 390 395 400 Phe Thr Ala Ser Gly LeuArg Val Arg Phe Leu Lys Val Trp Glu Lys 405 410 415 Ser Gly Tyr Asn ThrVal Glu Trp Val Arg Tyr Ile Thr Lys Ala Gly 420 425 430 Ser Tyr Glu IleArg Cys 435 <210> SEQ ID NO 183 <211> LENGTH: 554 <212> TYPE: DNA <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 183 gcacgagatt tcctggacagaccgaggcaa cgatgagtgc agaagttgaa ctgatctcta 60 caatggggga aaagaagttagcgaacaggc caccgattca gatggagttc caggttccga 120 tgttcactgc ttctggcttacgtgttcggt tcctcaaggt gtgggagaag agtggctaca 180 acaccgttga gtgggttcgctacatcacaa gggctggatc atatgaaatc aggtgttagt 240 gaccaagaaa aatggcgtgggctcattatt tcgtggattt gcagagctct ttttgtactc 300 atcatgattg atcagatggcacgtaaatta tgacccaatt cgttgcaagt ttgaacttca 360 gcagagggca gatcagagcaggcagttttc ctatgttcct tattttcatg ttgacatggg 420 atatgtattc tcccccgtgtcctgtttgtc atagggcata tttttattgt atatttggca 480 gtagcattcg ttctcgttgatcaatacacc cccttgccag ttcgtgctgt tttctcaaaa 540 aaaaaaaaaa aaaa 554<210> SEQ ID NO 184 <211> LENGTH: 78 <212> TYPE: PRT <213> ORGANISM:Triticum aestivum <400> SEQUENCE: 184 Thr Arg Phe Pro Gly Gln Thr GluAla Thr Met Ser Ala Glu Val Glu 1 5 10 15 Leu Ile Ser Thr Met Gly GluLys Lys Leu Ala Asn Arg Pro Pro Ile 20 25 30 Gln Met Glu Phe Gln Val ProMet Phe Thr Ala Ser Gly Leu Arg Val 35 40 45 Arg Phe Leu Lys Val Trp GluLys Ser Gly Tyr Asn Thr Val Glu Trp 50 55 60 Val Arg Tyr Ile Thr Arg AlaGly Ser Tyr Glu Ile Arg Cys 65 70 75 <210> SEQ ID NO 185 <211> LENGTH:3830 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 185ccacgcgtcc ggccgcggac gaatcaaaaa agcttcctcc tcccggcgtc gcctccgcgc 60gtggaggcga cgatccattc cgccccccgt aggattttgc tcctcccgat ccggacgtca 120gatctcgagt ggcggcacgc ggatctgggg gtttctctag aggcggcggc ggcggccgga 180acaggggcag atggcgctct cggggatgcg ggggctctcg gtcttcatca gcgacatacg 240caactgccac aacaaggagc aggagcgcct ccgcgtcgac aaggagctcg gcaacatccg 300cacgcggttc aaaaacgaga agggcttatc accatacgag aagaaaaaat atgtatggaa 360aatgctttac atttatatgc tgggttatga tgttgatttt gggcatatgg aaactgtctc 420gttgatatct gcaccaaagt atcctgagaa gcaggttggg tatattgtga catcttgctt 480actgaatgag aataatgatt tcttaagaat ggtaataaat acagtaagga atgacataat 540cggacgaaat gagaccttcc agtgcttagc attgacaatg gtaggtaaca ttggtggtaa 600agaattttct gagtcactag ccccagatgt ccagaaactt cttatttcaa gcagttgccg 660tcctgttgtc agaaagaagg ctgctctatg ccttctgcgg ctttatagga agaatcctga 720tgttgtgaat attgatggct gggctgaccg aatggcacaa cttctagatg agcgtgattt 780aggtgtgctg acatctgtca cgagtctttt tgtttcacta gtatccaaca atgttgaagc 840atattggaat tgtcttccta aatgtgtgag aatattggag aggttagcaa gaaatcaaga 900tattccacaa gagtacactt actatggaat tccatctcct tggcttcagg ttaagacaat 960gagagctctt cagtacttcc ctaccattga agatcccaac gcaagacgag ctttgtttga 1020ggttttacag cgcattttga tgggtactga tgttgttaaa aacgttaaca agaacaatgc 1080ctcacatgct gttctctttg aagctcttgc tctggtcatg catctggatg ctgaaaagga 1140gatgatgtca cagtgtgtgg ctcttcttgg gaagtttatt gcagtccggg agccaaacat 1200tcggtacctt ggtctggaaa acatgactag gatgctgtta gtaacagatg tgcaggatat 1260cattaaaagg caccaggctc agatcatcac atctttgaag gatccagata tcagcattag 1320gagaagggct cttgatttat tatatggcat gtgtgatgtt acaaacgcga aggaaattgt 1380ggaggagttg ttgcagtacc ttgatacagc agaatttgca atgcgtgaag agttgtctct 1440gaaggcagct attcttgcag aaaaatttgc tccgcaacta ttgtggtatg ttgatgttat 1500tcttcaactt atagacaaag ctggagattt tgtaagtgat gatatatggt atcgagtggt 1560acaatttgtt acgaacaacg aagacctaca gtcatacgct gcaactaaag cgagagaata 1620tcttgacaag cctgctttac atgagacgat ggtcaaggtg agtgcctacc ttcttgggga 1680gtatggacat cttttggccc ggagacctgg ctgtagccct aaggaattgt ttgctattat 1740aaatgataaa cttcctacag tatcggcaag tactgtagct attcttctct caacctatgc 1800caagatattg atgcacactc aacctcctga tgttggactg caacaacaaa tcctcacaat 1860atttaaaaag tacgagagct atattgatgt tgaaatacaa cagagagcag ttgaatattt 1920tgaactaagc aggaaaggtc ctgctttggc agatgtgttg gctgaaatgc caaaattccc 1980tgaacgtgag tctgcactgt tgaagaaggc tgaagacgct gaaattgaca cagcagagca 2040aagtgccata aagctacgca gtcagcaaca aacatctagt gctcttgttg tagctgatca 2100cccccctgca aatggatttg caccaccagt taataatctc actcttgtga agatgccaag 2160tcaaactgtt tctgacactc aggagagtgg agtaagttat gaagaagctc ctaagcctcc 2220tgttgaagct cccaaagaaa atgggactcc tgtcgaagtg cagaacaggg acacaaacat 2280tactgaaatc aacaatgaga ttaaagctga acctccctct acatcccatt ctacctctcc 2340cggagacctc ctcctggcag atcttttggg tcctcttgcg atagaaggcc ctcctgctgt 2400agagcaaaat cctgcccaag gattgaatgc taatcaaagt ccagctggtg acttggcact 2460tgctaccctt gatgatcagt ccaactcagt tcagccaatt gtcaacgtcg aggagaagtt 2520tcatattttg tgcacaaaag atagtggggt gctgtacgag gatccttaca tccagattgg 2580cttgaaagcg gagtggcgtg cccatcatgg ccgtcttgtt cttttcctgg gtaacaaaaa 2640tacttcagcc cttacgtcag tgcgggcgct gattttgcct cctagtcatt tgaaaatgga 2700actttcgtca gttcctgata ctattcctcc aagagcgcag gtgcaagttc cccttgaggt 2760tgcgaatctc cttgcaagta gagatgttgc ggttctagat ttctcgtatg cttttgggac 2820ttcactggtg gatgccaaac ttcgactccc tgttgtattg aataaatttt tgcagactat 2880aactcttacc cctgaagaat ttttctccca gtggaaagcg cttactgttc attcgctgaa 2940ggttcaagaa gtggttaaag gtgtgaaacc catgcctctt tctgagatgg ctaatctttt 3000catgagcctt cacttggcgg tcgctcctgg acttgataac aacccgaaca atctggttgc 3060atgcacgaca ttcttttctg aagcaacacg tgccatgctt tgtctgatac gagttgaaac 3120agatccacaa gacaggaccc aactgcggct aacagttgcc tcgggcgacc aatatttgac 3180attcgagttg aaagagttca tcaaagaaca tctggttgat atcccaagga ctcaggtcgc 3240cccacctcgc gcagcagtcc agccacagtt gcctactaca gcacctgcca catacaatga 3300ccctggtacc atgcttgctg gattgctatg agaagtaacc cgaaggaata gtgttttgat 3360gtacatgaca gttatttaat acgaggatct aacatctcca caagcatctt gctgccaaga 3420cgtacacgtg gagcaaccaa gaaagagctg gcggagtgta gtctccatga ggattctaaa 3480ccgagaaaga gcttagattc tttttgtgcg gtttgaaggc cccctccctc tccatgtagc 3540agttgccgta ttgtgtaagc gtaacttgtt gtttctcatt attatcatcc ccttttggtc 3600catttgtata acctgtatcg cccatgtctg gtggtggaca atcggtagtt tttgtgtgtt 3660aatataaggt gctaggtttt cccaccatgc cgattgtaaa gttgccctgg tgcgcattgc 3720actgcgattt gtacggaata gggaaggaaa aatttatagg ttttgctttg tgcatgttca 3780aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag 3830 <210> SEQ IDNO 186 <211> LENGTH: 1046 <212> TYPE: PRT <213> ORGANISM: Zea mays <400>SEQUENCE: 186 Met Ala Leu Ser Gly Met Arg Gly Leu Ser Val Phe Ile SerAsp Ile 1 5 10 15 Arg Asn Cys His Asn Lys Glu Gln Glu Arg Leu Arg ValAsp Lys Glu 20 25 30 Leu Gly Asn Ile Arg Thr Arg Phe Lys Asn Glu Lys GlyLeu Ser Pro 35 40 45 Tyr Glu Lys Lys Lys Tyr Val Trp Lys Met Leu Tyr IleTyr Met Leu 50 55 60 Gly Tyr Asp Val Asp Phe Gly His Met Glu Thr Val SerLeu Ile Ser 65 70 75 80 Ala Pro Lys Tyr Pro Glu Lys Gln Val Gly Tyr IleVal Thr Ser Cys 85 90 95 Leu Leu Asn Glu Asn Asn Asp Phe Leu Arg Met ValIle Asn Thr Val 100 105 110 Arg Asn Asp Ile Ile Gly Arg Asn Glu Thr PheGln Cys Leu Ala Leu 115 120 125 Thr Met Val Gly Asn Ile Gly Gly Lys GluPhe Ser Glu Ser Leu Ala 130 135 140 Pro Asp Val Gln Lys Leu Leu Ile SerSer Ser Cys Arg Pro Val Val 145 150 155 160 Arg Lys Lys Ala Ala Leu CysLeu Leu Arg Leu Tyr Arg Lys Asn Pro 165 170 175 Asp Val Val Asn Ile AspGly Trp Ala Asp Arg Met Ala Gln Leu Leu 180 185 190 Asp Glu Arg Asp LeuGly Val Leu Thr Ser Val Thr Ser Leu Phe Val 195 200 205 Ser Leu Val SerAsn Asn Val Glu Ala Tyr Trp Asn Cys Leu Pro Lys 210 215 220 Cys Val ArgIle Leu Glu Arg Leu Ala Arg Asn Gln Asp Ile Pro Gln 225 230 235 240 GluTyr Thr Tyr Tyr Gly Ile Pro Ser Pro Trp Leu Gln Val Lys Thr 245 250 255Met Arg Ala Leu Gln Tyr Phe Pro Thr Ile Glu Asp Pro Asn Ala Arg 260 265270 Arg Ala Leu Phe Glu Val Leu Gln Arg Ile Leu Met Gly Thr Asp Val 275280 285 Val Lys Asn Val Asn Lys Asn Asn Ala Ser His Ala Val Leu Phe Glu290 295 300 Ala Leu Ala Leu Val Met His Leu Asp Ala Glu Lys Glu Met MetSer 305 310 315 320 Gln Cys Val Ala Leu Leu Gly Lys Phe Ile Ala Val ArgGlu Pro Asn 325 330 335 Ile Arg Tyr Leu Gly Leu Glu Asn Met Thr Arg MetLeu Leu Val Thr 340 345 350 Asp Val Gln Asp Ile Ile Lys Arg His Gln AlaGln Ile Ile Thr Ser 355 360 365 Leu Lys Asp Pro Asp Ile Ser Ile Arg ArgArg Ala Leu Asp Leu Leu 370 375 380 Tyr Gly Met Cys Asp Val Thr Asn AlaLys Glu Ile Val Glu Glu Leu 385 390 395 400 Leu Gln Tyr Leu Asp Thr AlaGlu Phe Ala Met Arg Glu Glu Leu Ser 405 410 415 Leu Lys Ala Ala Ile LeuAla Glu Lys Phe Ala Pro Gln Leu Leu Trp 420 425 430 Tyr Val Asp Val IleLeu Gln Leu Ile Asp Lys Ala Gly Asp Phe Val 435 440 445 Ser Asp Asp IleTrp Tyr Arg Val Val Gln Phe Val Thr Asn Asn Glu 450 455 460 Asp Leu GlnSer Tyr Ala Ala Thr Lys Ala Arg Glu Tyr Leu Asp Lys 465 470 475 480 ProAla Leu His Glu Thr Met Val Lys Val Ser Ala Tyr Leu Leu Gly 485 490 495Glu Tyr Gly His Leu Leu Ala Arg Arg Pro Gly Cys Ser Pro Lys Glu 500 505510 Leu Phe Ala Ile Ile Asn Asp Lys Leu Pro Thr Val Ser Ala Ser Thr 515520 525 Val Ala Ile Leu Leu Ser Thr Tyr Ala Lys Ile Leu Met His Thr Gln530 535 540 Pro Pro Asp Val Gly Leu Gln Gln Gln Ile Leu Thr Ile Phe LysLys 545 550 555 560 Tyr Glu Ser Tyr Ile Asp Val Glu Ile Gln Gln Arg AlaVal Glu Tyr 565 570 575 Phe Glu Leu Ser Arg Lys Gly Pro Ala Leu Ala AspVal Leu Ala Glu 580 585 590 Met Pro Lys Phe Pro Glu Arg Glu Ser Ala LeuLeu Lys Lys Ala Glu 595 600 605 Asp Ala Glu Ile Asp Thr Ala Glu Gln SerAla Ile Lys Leu Arg Ser 610 615 620 Gln Gln Gln Thr Ser Ser Ala Leu ValVal Ala Asp His Pro Pro Ala 625 630 635 640 Asn Gly Phe Ala Pro Pro ValAsn Asn Leu Thr Leu Val Lys Met Pro 645 650 655 Ser Gln Thr Val Ser AspThr Gln Glu Ser Gly Val Ser Tyr Glu Glu 660 665 670 Ala Pro Lys Pro ProVal Glu Ala Pro Lys Glu Asn Gly Thr Pro Val 675 680 685 Glu Val Gln AsnArg Asp Thr Asn Ile Thr Glu Ile Asn Asn Glu Ile 690 695 700 Lys Ala GluPro Pro Ser Thr Ser His Ser Thr Ser Pro Gly Asp Leu 705 710 715 720 LeuLeu Ala Asp Leu Leu Gly Pro Leu Ala Ile Glu Gly Pro Pro Ala 725 730 735Val Glu Gln Asn Pro Ala Gln Gly Leu Asn Ala Asn Gln Ser Pro Ala 740 745750 Gly Asp Leu Ala Leu Ala Thr Leu Asp Asp Gln Ser Asn Ser Val Gln 755760 765 Pro Ile Val Asn Val Glu Glu Lys Phe His Ile Leu Cys Thr Lys Asp770 775 780 Ser Gly Val Leu Tyr Glu Asp Pro Tyr Ile Gln Ile Gly Leu LysAla 785 790 795 800 Glu Trp Arg Ala His His Gly Arg Leu Val Leu Phe LeuGly Asn Lys 805 810 815 Asn Thr Ser Ala Leu Thr Ser Val Arg Ala Leu IleLeu Pro Pro Ser 820 825 830 His Leu Lys Met Glu Leu Ser Ser Val Pro AspThr Ile Pro Pro Arg 835 840 845 Ala Gln Val Gln Val Pro Leu Glu Val AlaAsn Leu Leu Ala Ser Arg 850 855 860 Asp Val Ala Val Leu Asp Phe Ser TyrAla Phe Gly Thr Ser Leu Val 865 870 875 880 Asp Ala Lys Leu Arg Leu ProVal Val Leu Asn Lys Phe Leu Gln Thr 885 890 895 Ile Thr Leu Thr Pro GluGlu Phe Phe Ser Gln Trp Lys Ala Leu Thr 900 905 910 Val His Ser Leu LysVal Gln Glu Val Val Lys Gly Val Lys Pro Met 915 920 925 Pro Leu Ser GluMet Ala Asn Leu Phe Met Ser Leu His Leu Ala Val 930 935 940 Ala Pro GlyLeu Asp Asn Asn Pro Asn Asn Leu Val Ala Cys Thr Thr 945 950 955 960 PhePhe Ser Glu Ala Thr Arg Ala Met Leu Cys Leu Ile Arg Val Glu 965 970 975Thr Asp Pro Gln Asp Arg Thr Gln Leu Arg Leu Thr Val Ala Ser Gly 980 985990 Asp Gln Tyr Leu Thr Phe Glu Leu Lys Glu Phe Ile Lys Glu His Leu 9951000 1005 Val Asp Ile Pro Arg Thr Gln Val Ala Pro Pro Arg Ala Ala ValGln 1010 1015 1020 Pro Gln Leu Pro Thr Thr Ala Pro Ala Thr Tyr Asn AspPro Gly Thr 1025 1030 1035 1040 Met Leu Ala Gly Leu Leu 1045 <210> SEQID NO 187 <211> LENGTH: 1527 <212> TYPE: DNA <213> ORGANISM: Glycine max<400> SEQUENCE: 187 ggatgaggca gatcaaagat tatcccagga aaatgggactttgagtatag tagattctca 60 acctccctct gcagatctcc ttggcgatct cttgggtccactggctattg aaggccctcc 120 cagtagcagt gtccacctgc agccaagctc aaattcaggagtggaaggaa ctgtagttga 180 agccacagcc atagtacctg ctggagaaca ggccaattctgtacagccaa ttggaaatat 240 tgctgaaaga tttcatgctt tgtgcgtgaa ggatagtggtgttctatatg aggatcctta 300 tattcagatt ggcattaaag cagaatggag agctcatcaagggcatcttg ttcttttctt 360 gggaaacaag aatacttctc ctcttgtctc tgttcaagctttaatattgc ctcccacgca 420 tttgaagatg gagctctctc tagtaccaga aactatacctccccgggcac aagtgcaatg 480 cccacttgag gtcattaatc tccacccaag cagggatgttgccgttcttg acttctccta 540 caagtttggt aacgatatgg tcaatgtcaa gcttcgcctacctgctgtcc taaataaatt 600 tcttcagcct ataactatat ctgctgaaga gtttttccctcaatggagat cacttcctgg 660 accacctttg aaacttcaag aagtggtcag aggtgttagaccccttccac tgctcgaaat 720 ggcaaactta ttcaatagtt accatttgac agtttgcccagggcttgatc ccaatcctaa 780 caatcttgtt gtgagtacaa cattttattc agaaagtacaagggcaatgc tttgtttagt 840 aagaattgaa acagatccag cagacagaac ccagctgcgaatgacggttg cttcagggga 900 cccaacacta acatttgaga tgaaggagtt catcaaggaccaattagtca gcattcccgc 960 catagcaacc cgtgtaccaa cacaaccagc tccgacatctccaccactgg cccaaccagg 1020 cagtgctcct gcagcattga cggatcctgg ggcaatgctggctgctcttt tatgacaata 1080 tagaggtgtt cattaattat tacacatttg taaacctgatctacacagcc taacccatgt 1140 ctccctcaaa gctagtgagg tgaatagaca aaatcatgatgcaagggctg gcggaattaa 1200 gagtgagctt gtttaccagc tgagtgtttg agcagggatttctgggttgg tgggcatttt 1260 tgagattctt tctgcacaaa gtgattgtcc tttaagcaagattttatatc agttgggcga 1320 gggttttcag ttttctcata tttttcccct tcccagtttcactggattgt ttgtggtgca 1380 cctgatagca tgtaagatat tgcctttgat agaaacaattgtaagtttga aattgatact 1440 ttgacgttac atacaataat agcttgtgga gtaaggacatacgcattttt gctatcaaat 1500 gctatctagg gttcaaactt caaaaaa 1527 <210> SEQID NO 188 <211> LENGTH: 357 <212> TYPE: PRT <213> ORGANISM: Glycine max<400> SEQUENCE: 188 Asp Glu Ala Asp Gln Arg Leu Ser Gln Glu Asn Gly ThrLeu Ser Ile 1 5 10 15 Val Asp Ser Gln Pro Pro Ser Ala Asp Leu Leu GlyAsp Leu Leu Gly 20 25 30 Pro Leu Ala Ile Glu Gly Pro Pro Ser Ser Ser ValHis Leu Gln Pro 35 40 45 Ser Ser Asn Ser Gly Val Glu Gly Thr Val Val GluAla Thr Ala Ile 50 55 60 Val Pro Ala Gly Glu Gln Ala Asn Ser Val Gln ProIle Gly Asn Ile 65 70 75 80 Ala Glu Arg Phe His Ala Leu Cys Val Lys AspSer Gly Val Leu Tyr 85 90 95 Glu Asp Pro Tyr Ile Gln Ile Gly Ile Lys AlaGlu Trp Arg Ala His 100 105 110 Gln Gly His Leu Val Leu Phe Leu Gly AsnLys Asn Thr Ser Pro Leu 115 120 125 Val Ser Val Gln Ala Leu Ile Leu ProPro Thr His Leu Lys Met Glu 130 135 140 Leu Ser Leu Val Pro Glu Thr IlePro Pro Arg Ala Gln Val Gln Cys 145 150 155 160 Pro Leu Glu Val Ile AsnLeu His Pro Ser Arg Asp Val Ala Val Leu 165 170 175 Asp Phe Ser Tyr LysPhe Gly Asn Asp Met Val Asn Val Lys Leu Arg 180 185 190 Leu Pro Ala ValLeu Asn Lys Phe Leu Gln Pro Ile Thr Ile Ser Ala 195 200 205 Glu Glu PhePhe Pro Gln Trp Arg Ser Leu Pro Gly Pro Pro Leu Lys 210 215 220 Leu GlnGlu Val Val Arg Gly Val Arg Pro Leu Pro Leu Leu Glu Met 225 230 235 240Ala Asn Leu Phe Asn Ser Tyr His Leu Thr Val Cys Pro Gly Leu Asp 245 250255 Pro Asn Pro Asn Asn Leu Val Val Ser Thr Thr Phe Tyr Ser Glu Ser 260265 270 Thr Arg Ala Met Leu Cys Leu Val Arg Ile Glu Thr Asp Pro Ala Asp275 280 285 Arg Thr Gln Leu Arg Met Thr Val Ala Ser Gly Asp Pro Thr LeuThr 290 295 300 Phe Glu Met Lys Glu Phe Ile Lys Asp Gln Leu Val Ser IlePro Ala 305 310 315 320 Ile Ala Thr Arg Val Pro Thr Gln Pro Ala Pro ThrSer Pro Pro Leu 325 330 335 Ala Gln Pro Gly Ser Ala Pro Ala Ala Leu ThrAsp Pro Gly Ala Met 340 345 350 Leu Ala Ala Leu Leu 355 <210> SEQ ID NO189 <211> LENGTH: 3202 <212> TYPE: DNA <213> ORGANISM: Zea mays <400>SEQUENCE: 189 ccacgcgtcc gatcgcaagc gtccacaccg tgaccaccgg cgccgctgcggcgtccggag 60 caggcggcga gcgtcgtcca cagggtaggc tcggctcgct gaggcggacgagatgagcgg 120 gcacgactcc aagtacttct ctaccaccaa gaagggggag atccccgagctcaaggagga 180 gctcaactcc cagtataagg acaagagaaa agatgctgtc aagaaagtgattgctgctat 240 gactgtagga aaggatgtct catcattgtt cactgatgtt gtgaactgcatgcagactga 300 gaacttggag ctcaagaaac tagtatattt gtatctcatc aactatgctaaaagtcaacc 360 tgatcttgcc attcttgctg tgaacacatt tgttaaggat tcacaagacccaaacccatt 420 gattcgtgct ttggctgtta ggacaatggg ttgtatccgc gtggacaaaatcacagagta 480 tctctgtgat ccacttcaaa gatgcctcaa ggatgacgat ccgtatgtacggaagactgc 540 agctatttgc gttgctaaac tttatgatat aaacgctgag ctagtagaggacagaggatt 600 tctggaggcc cttaaggact taatatctga caataatcct atggttgttgcaaatgctgt 660 tgctgctctg gcagagattc aagatagtag tgttcagcca atctttgaaatcaccagcca 720 tacactgtca aagcttctga cagctttgaa tgaatgcaca gagtggggccaagttttcat 780 tttggattct ttgtcaagat ataaagcagc agatgccagg gaagctgaaaacatagtgga 840 acgagttaca ccccgtctcc aacatgcaaa ttgtgcggtt gttctttctgctgtcaagat 900 aatccttcta caaatggagc tcattacgag cacggatgta gtcaggaatctctgcaagaa 960 aatggctccc cctcttgtta ctcttttgtc agcagagcct gaaattcagtatgtagcctt 1020 gaggaacatt aatctgatag ttcaaaagag gcctacaata ctcgctcatgagattaaggt 1080 tttcttttgc aagtacaatg accctatata tgttaagatg gaaaagctggagattatgat 1140 aaagcttgcc tcagatcgaa atatagatca ggtgacttct ctctgttcaagcgttgacag 1200 agcttcttgt aggtgctctt ggaattcaag gagtatgcca cagaggttgatgttgatttt 1260 gtgcggaaag ctgttcgtgc gattgggaga tgtgcaatta aattggagagagctgctgaa 1320 aggtgcatca cgcgttttgc tcgagctgat taagataaaa gttaattatgttgttcagga 1380 agctataatt gttatcaaag acatcttcag acgctaccct aatacgtatgagtctatcat 1440 tgctacactt tgtgaaagtc tggacacttt agatgaacca gaggctaaggcatccatgat 1500 ttggataatt ggagaatatg ctgaaagaat tgacaatgca gatgaacttcttgagagctt 1560 cttggaaaca ttccctgaag aaccagcatt agttcaactg cagctgctaaccgctactgt 1620 taaattgttt cttaagaagc caacagaggg gccacaacag atgattcaggctgttctcaa 1680 taatgcaaca gttgaaacag ataatcctga tttgagggac agagcttacatatattggcg 1740 acttctgtct actgatcctg aggctgcgaa agatgttgtt ttggcggagaaacctgtgat 1800 cagtgatgac tccaaccagc ttgactcatc acttcttgat gagctgctagcaaacatttc 1860 taccctttca tcagtttatc ataagccccc agaatcattt gtcagccgtgttaaggctgc 1920 tcctagggcc gatgatgaag agtttgctga tacagctgaa acagggtactcggagtcccc 1980 atcccagggt gttgatgggg catcaccttc atctagtgct ggtacttcttctaatgtacc 2040 cgtgaagcag ctggcggttg catccccacc tgcaatgcca gaccttttaggtgatttgat 2100 gggtatagat aatgctattg ttcctgttga tgaacctgca gcaccttccggccctccact 2160 acctgtctta ctgccttcga ctacaggtca aggactgcaa attagtgcacaactaacacg 2220 gcgtgatggc cagatatact atgacatatc ttttgagaat ggcacccaaggtgtcctaga 2280 tggatttatg attcagttta acaagaacac atttggtctt gctgctggtgaagcacttca 2340 ggttactcca ctgcaaccag gccaatcaac aaggacactt ttacaaatgaccccgttcca 2400 gaatatcagc cctggtgcac caaactcgct actacaggtt gctgtgaaaaataatcagca 2460 gccagtgtgg tacttcaatg acaaaattcc gctgcatgtt ttctttggtgaagatggaaa 2520 aatggaacga gctggttttc ttgaggcctg gaaatctttg cctgatgacaatgaatttac 2580 aaaagaattc ccgggctctg tcatcagcag catagatgct actgttgagcgcctcgtagc 2640 atcaaatgtg ttcttcatag ccaagcggaa aaatgcgaac atggatgttctgtatctctc 2700 cgcgaagatg ccccgtggaa tccccttcct tatagaggtt acagccgtggttggtgttcc 2760 tggtgtgaag tgtgcagtca aaacaccaaa tagggagatg gttcctctcttctttgaagc 2820 tatggaggct ctcaccaagt gacgacaaac ttaatggatc tggatcggtggttctacaaa 2880 aaaaatacca gtcgatgagc tgctataggt gtttggacgt ggcattttatttttcacgga 2940 agctggtgtg aattgtagtt tttttggtat tagattacag tatttaaactgctagtttcc 3000 tggttccaaa gttttttcac cagaaatact ggaggtgttc atgccttgcatgatttgtac 3060 atcttaccat gttgtatgaa gcgatgaaat tgtaggtgac gaaaaagtagaataagcata 3120 gattaactgg tacgattgtg gattgttatt aatttccctg caacccaatcactattttaa 3180 tggtgatgta tctatatttt tc 3202 <210> SEQ ID NO 190 <211>LENGTH: 909 <212> TYPE: PRT <213> ORGANISM: Zea mays <400> SEQUENCE: 190Met Ser Gly His Asp Ser Lys Tyr Phe Ser Thr Thr Lys Lys Gly Glu 1 5 1015 Ile Pro Glu Leu Lys Glu Glu Leu Asn Ser Gln Tyr Lys Asp Lys Arg 20 2530 Lys Asp Ala Val Lys Lys Val Ile Ala Ala Met Thr Val Gly Lys Asp 35 4045 Val Ser Ser Leu Phe Thr Asp Val Val Asn Cys Met Gln Thr Glu Asn 50 5560 Leu Glu Leu Lys Lys Leu Val Tyr Leu Tyr Leu Ile Asn Tyr Ala Lys 65 7075 80 Ser Gln Pro Asp Leu Ala Ile Leu Ala Val Asn Thr Phe Val Lys Asp 8590 95 Ser Gln Asp Pro Asn Pro Leu Ile Arg Ala Leu Ala Val Arg Thr Met100 105 110 Gly Cys Ile Arg Val Asp Lys Ile Thr Glu Tyr Leu Cys Asp ProLeu 115 120 125 Gln Arg Cys Leu Lys Asp Asp Asp Pro Tyr Val Arg Lys ThrAla Ala 130 135 140 Ile Cys Val Ala Lys Leu Tyr Asp Ile Asn Ala Glu LeuVal Glu Asp 145 150 155 160 Arg Gly Phe Leu Glu Ala Leu Lys Asp Leu IleSer Asp Asn Asn Pro 165 170 175 Met Val Val Ala Asn Ala Val Ala Ala LeuAla Glu Ile Gln Asp Ser 180 185 190 Ser Val Gln Pro Ile Phe Glu Ile ThrSer His Thr Leu Ser Lys Leu 195 200 205 Leu Thr Ala Leu Asn Glu Cys ThrGlu Trp Gly Gln Val Phe Ile Leu 210 215 220 Asp Ser Leu Ser Arg Tyr LysAla Ala Asp Ala Arg Glu Ala Glu Asn 225 230 235 240 Ile Val Glu Arg ValThr Pro Arg Leu Gln His Ala Asn Cys Ala Val 245 250 255 Val Leu Ser AlaVal Lys Ile Ile Leu Leu Gln Met Glu Leu Ile Thr 260 265 270 Ser Thr AspVal Val Arg Asn Leu Cys Lys Lys Met Ala Pro Pro Leu 275 280 285 Val ThrLeu Leu Ser Ala Glu Pro Glu Ile Gln Tyr Val Ala Leu Arg 290 295 300 AsnIle Asn Leu Ile Val Gln Lys Arg Pro Thr Ile Leu Ala His Glu 305 310 315320 Ile Lys Val Phe Phe Cys Lys Tyr Asn Asp Pro Ile Tyr Val Lys Met 325330 335 Glu Lys Leu Glu Ile Met Ile Lys Leu Ala Ser Asp Arg Asn Ile Asp340 345 350 Gln Val Thr Ser Leu Cys Ser Ser Val Asp Arg Ala Ser Cys ArgCys 355 360 365 Ser Trp Asn Ser Arg Ser Met Pro Gln Arg Leu Met Leu IleLeu Cys 370 375 380 Gly Lys Leu Phe Val Arg Leu Gly Asp Val Gln Leu AsnTrp Arg Glu 385 390 395 400 Leu Leu Lys Gly Ala Ser Arg Val Leu Leu GluLeu Ile Lys Ile Lys 405 410 415 Val Asn Tyr Val Val Gln Glu Ala Ile IleVal Ile Lys Asp Ile Phe 420 425 430 Arg Arg Tyr Pro Asn Thr Tyr Glu SerIle Ile Ala Thr Leu Cys Glu 435 440 445 Ser Leu Asp Thr Leu Asp Glu ProGlu Ala Lys Ala Ser Met Ile Trp 450 455 460 Ile Ile Gly Glu Tyr Ala GluArg Ile Asp Asn Ala Asp Glu Leu Leu 465 470 475 480 Glu Ser Phe Leu GluThr Phe Pro Glu Glu Pro Ala Leu Val Gln Leu 485 490 495 Gln Leu Leu ThrAla Thr Val Lys Leu Phe Leu Lys Lys Pro Thr Glu 500 505 510 Gly Pro GlnGln Met Ile Gln Ala Val Leu Asn Asn Ala Thr Val Glu 515 520 525 Thr AspAsn Pro Asp Leu Arg Asp Arg Ala Tyr Ile Tyr Trp Arg Leu 530 535 540 LeuSer Thr Asp Pro Glu Ala Ala Lys Asp Val Val Leu Ala Glu Lys 545 550 555560 Pro Val Ile Ser Asp Asp Ser Asn Gln Leu Asp Ser Ser Leu Leu Asp 565570 575 Glu Leu Leu Ala Asn Ile Ser Thr Leu Ser Ser Val Tyr His Lys Pro580 585 590 Pro Glu Ser Phe Val Ser Arg Val Lys Ala Ala Pro Arg Ala AspAsp 595 600 605 Glu Glu Phe Ala Asp Thr Ala Glu Thr Gly Tyr Ser Glu SerPro Ser 610 615 620 Gln Gly Val Asp Gly Ala Ser Pro Ser Ser Ser Ala GlyThr Ser Ser 625 630 635 640 Asn Val Pro Val Lys Gln Leu Ala Val Ala SerPro Pro Ala Met Pro 645 650 655 Asp Leu Leu Gly Asp Leu Met Gly Ile AspAsn Ala Ile Val Pro Val 660 665 670 Asp Glu Pro Ala Ala Pro Ser Gly ProPro Leu Pro Val Leu Leu Pro 675 680 685 Ser Thr Thr Gly Gln Gly Leu GlnIle Ser Ala Gln Leu Thr Arg Arg 690 695 700 Asp Gly Gln Ile Tyr Tyr AspIle Ser Phe Glu Asn Gly Thr Gln Gly 705 710 715 720 Val Leu Asp Gly PheMet Ile Gln Phe Asn Lys Asn Thr Phe Gly Leu 725 730 735 Ala Ala Gly GluAla Leu Gln Val Thr Pro Leu Gln Pro Gly Gln Ser 740 745 750 Thr Arg ThrLeu Leu Gln Met Thr Pro Phe Gln Asn Ile Ser Pro Gly 755 760 765 Ala ProAsn Ser Leu Leu Gln Val Ala Val Lys Asn Asn Gln Gln Pro 770 775 780 ValTrp Tyr Phe Asn Asp Lys Ile Pro Leu His Val Phe Phe Gly Glu 785 790 795800 Asp Gly Lys Met Glu Arg Ala Gly Phe Leu Glu Ala Trp Lys Ser Leu 805810 815 Pro Asp Asp Asn Glu Phe Thr Lys Glu Phe Pro Gly Ser Val Ile Ser820 825 830 Ser Ile Asp Ala Thr Val Glu Arg Leu Val Ala Ser Asn Val PhePhe 835 840 845 Ile Ala Lys Arg Lys Asn Ala Asn Met Asp Val Leu Tyr LeuSer Ala 850 855 860 Lys Met Pro Arg Gly Ile Pro Phe Leu Ile Glu Val ThrAla Val Val 865 870 875 880 Gly Val Pro Gly Val Lys Cys Ala Val Lys ThrPro Asn Arg Glu Met 885 890 895 Val Pro Leu Phe Phe Glu Ala Met Glu AlaLeu Thr Lys 900 905 <210> SEQ ID NO 191 <211> LENGTH: 2258 <212> TYPE:DNA <213> ORGANISM: Oryza sativa <400> SEQUENCE: 191 gcaccaggcaacatgcaaat tgtgctgttg ttctctctgc tgtaaagata atccttttac 60 aaatggagctcattactagc acagatgttg tccgtaatct ctgtaagaaa atggcacccc 120 ctcttgttacgcttttgtca gcagaacctg agattcagta tgtagcgtta aggaacatta 180 atctgattgttcagaaaagg cctactatac ttgcacatga aattaaggtc ttcttttgca 240 agtataatgacccaatatat gttaagatgg aaaagctgga gattatgata aagcttgcct 300 ctgacagaaacatagatcag gttctattgg aattcaaaga gtatgccaca gaggtggatg 360 ttgactttgtccgcaaagct gttcgtgcta ttggaagatg tgcaattaag ttggagagag 420 ctgctgaaaggtgtatcagt gttttgcttg agctgattaa gataaaggtt aattatgttg 480 tgcaggaggctataattgtt attaaggaca tcttcaggcg ctaccctaat acgtatgagt 540 cgatcatcgcaacactctgt gaaagcctag acaccttaga tgaaccagag gctaaggcat 600 caatgatttggattattgga gaatatgctg aaaggattga caatgctgac gaacttcttg 660 agagcttcttggaaacattc ccagaagaac cagcattagt tcaattgcag ttactaacgg 720 caactgttaagttgttcctt aaaaagccaa ctgaggggcc tcaacagatg atacaggctg 780 ttctcaataatgcaacagtt gaaacagaca atcctgattt gcgcgaccga gcttatatat 840 actggcgacttctttctact gatcctgagg cagctaaaga tgtagttttg gcagagaaac 900 ctgtgatcagcgatgattcc aaccagcttg attcttctct cctagatgat ctgctagcca 960 atatttctaccctttcatca gtttatcaca agcctccaga agcatttgtt agccgcgtta 1020 aaacagctcctagggctgat gatgaggagt ttgctgatac agctgaaaca ggatattcgg 1080 agtcaccatctcagggtgtt gatggggcat caccttcctc tagtgctggc acttcttcta 1140 atgttccagtgaagcagcca gcagcaccag ctgctcctgc tccaatgcca gacctccttg 1200 gtgatttgatgggtatggat aactccattg ttcctgttga tgaaccaaca gcaccttcag 1260 gccctccactacctgttttg ttgccatcaa ccactggcca aggactgcag atcagcgcac 1320 aactagtgcggcgtgatggc caaatattct atgatatatc ttttgataat ggcactcaaa 1380 ctgtgctagatggattcatg attcagttta acaaaaatac ctttggcctt gcagccggtg 1440 gtgcacttcaggtctctcca ctgcaacctg ggacctcggc caggacgctg ctacctatgg 1500 tggcattccagaatctctct cctggagcgc caagctcact gctgcaggtt gcggtgaaga 1560 ataatcagcaacctgtgtgg tacttcaatg acaaaatccc tatgcatgcc ttctttggtg 1620 aagatggcaaaatggaacga acaagttttc ttgaggcctg gaaatcttta cctgatgaca 1680 acgaattttcgaaagagttc ccctcttctg tcgtcagcag catagatgcg accgttgagc 1740 accttgcagcatcaaatgtg ttctttatcg ccaagaggaa aaactcaaac aaggatgttc 1800 tgtacatgtctgcaaagatt ccgcgtggaa tccccttcct gatagagctt actgctgcag 1860 tcggtgttcctggcgtgaag tgtgcggtca aaactccaaa caaggagatg gtggctctct 1920 tcttcgaagccatggagtct cttctcaagt gatacaaaat tgaaggatca ttgttccttc 1980 caaattgatcagttcatgag ctattgtagg tttggatgcg gcgttgtttc acaggagctg 2040 gtgtgaattgtatttgttgt tctttgtatt agattactgt atttaaactg ctagtttcct 2100 ggtttcaaagttttttcacg acgaacactc gacatggcca ttttgtctgc gtaatttgta 2160 tatctcggacctgaacctct tttcatgcca tagcatatga aatataagta gcttgcaaac 2220 tcgtaaaaaaaaaaaaaaaa aaaaaaaaaa aaaaaaaa 2258 <210> SEQ ID NO 192 <211> LENGTH:649 <212> TYPE: PRT <213> ORGANISM: Oryza sativa <400> SEQUENCE: 192 ThrArg Gln His Ala Asn Cys Ala Val Val Leu Ser Ala Val Lys Ile 1 5 10 15Ile Leu Leu Gln Met Glu Leu Ile Thr Ser Thr Asp Val Val Arg Asn 20 25 30Leu Cys Lys Lys Met Ala Pro Pro Leu Val Thr Leu Leu Ser Ala Glu 35 40 45Pro Glu Ile Gln Tyr Val Ala Leu Arg Asn Ile Asn Leu Ile Val Gln 50 55 60Lys Arg Pro Thr Ile Leu Ala His Glu Ile Lys Val Phe Phe Cys Lys 65 70 7580 Tyr Asn Asp Pro Ile Tyr Val Lys Met Glu Lys Leu Glu Ile Met Ile 85 9095 Lys Leu Ala Ser Asp Arg Asn Ile Asp Gln Val Leu Leu Glu Phe Lys 100105 110 Glu Tyr Ala Thr Glu Val Asp Val Asp Phe Val Arg Lys Ala Val Arg115 120 125 Ala Ile Gly Arg Cys Ala Ile Lys Leu Glu Arg Ala Ala Glu ArgCys 130 135 140 Ile Ser Val Leu Leu Glu Leu Ile Lys Ile Lys Val Asn TyrVal Val 145 150 155 160 Gln Glu Ala Ile Ile Val Ile Lys Asp Ile Phe ArgArg Tyr Pro Asn 165 170 175 Thr Tyr Glu Ser Ile Ile Ala Thr Leu Cys GluSer Leu Asp Thr Leu 180 185 190 Asp Glu Pro Glu Ala Lys Ala Ser Met IleTrp Ile Ile Gly Glu Tyr 195 200 205 Ala Glu Arg Ile Asp Asn Ala Asp GluLeu Leu Glu Ser Phe Leu Glu 210 215 220 Thr Phe Pro Glu Glu Pro Ala LeuVal Gln Leu Gln Leu Leu Thr Ala 225 230 235 240 Thr Val Lys Leu Phe LeuLys Lys Pro Thr Glu Gly Pro Gln Gln Met 245 250 255 Ile Gln Ala Val LeuAsn Asn Ala Thr Val Glu Thr Asp Asn Pro Asp 260 265 270 Leu Arg Asp ArgAla Tyr Ile Tyr Trp Arg Leu Leu Ser Thr Asp Pro 275 280 285 Glu Ala AlaLys Asp Val Val Leu Ala Glu Lys Pro Val Ile Ser Asp 290 295 300 Asp SerAsn Gln Leu Asp Ser Ser Leu Leu Asp Asp Leu Leu Ala Asn 305 310 315 320Ile Ser Thr Leu Ser Ser Val Tyr His Lys Pro Pro Glu Ala Phe Val 325 330335 Ser Arg Val Lys Thr Ala Pro Arg Ala Asp Asp Glu Glu Phe Ala Asp 340345 350 Thr Ala Glu Thr Gly Tyr Ser Glu Ser Pro Ser Gln Gly Val Asp Gly355 360 365 Ala Ser Pro Ser Ser Ser Ala Gly Thr Ser Ser Asn Val Pro ValLys 370 375 380 Gln Pro Ala Ala Pro Ala Ala Pro Ala Pro Met Pro Asp LeuLeu Gly 385 390 395 400 Asp Leu Met Gly Met Asp Asn Ser Ile Val Pro ValAsp Glu Pro Thr 405 410 415 Ala Pro Ser Gly Pro Pro Leu Pro Val Leu LeuPro Ser Thr Thr Gly 420 425 430 Gln Gly Leu Gln Ile Ser Ala Gln Leu ValArg Arg Asp Gly Gln Ile 435 440 445 Phe Tyr Asp Ile Ser Phe Asp Asn GlyThr Gln Thr Val Leu Asp Gly 450 455 460 Phe Met Ile Gln Phe Asn Lys AsnThr Phe Gly Leu Ala Ala Gly Gly 465 470 475 480 Ala Leu Gln Val Ser ProLeu Gln Pro Gly Thr Ser Ala Arg Thr Leu 485 490 495 Leu Pro Met Val AlaPhe Gln Asn Leu Ser Pro Gly Ala Pro Ser Ser 500 505 510 Leu Leu Gln ValAla Val Lys Asn Asn Gln Gln Pro Val Trp Tyr Phe 515 520 525 Asn Asp LysIle Pro Met His Ala Phe Phe Gly Glu Asp Gly Lys Met 530 535 540 Glu ArgThr Ser Phe Leu Glu Ala Trp Lys Ser Leu Pro Asp Asp Asn 545 550 555 560Glu Phe Ser Lys Glu Phe Pro Ser Ser Val Val Ser Ser Ile Asp Ala 565 570575 Thr Val Glu His Leu Ala Ala Ser Asn Val Phe Phe Ile Ala Lys Arg 580585 590 Lys Asn Ser Asn Lys Asp Val Leu Tyr Met Ser Ala Lys Ile Pro Arg595 600 605 Gly Ile Pro Phe Leu Ile Glu Leu Thr Ala Ala Val Gly Val ProGly 610 615 620 Val Lys Cys Ala Val Lys Thr Pro Asn Lys Glu Met Val AlaLeu Phe 625 630 635 640 Phe Glu Ala Met Glu Ser Leu Leu Lys 645 <210>SEQ ID NO 193 <211> LENGTH: 1544 <212> TYPE: DNA <213> ORGANISM: Glycinemax <400> SEQUENCE: 193 ggcaccagac cacttgtaac tcttgttgct tcagccccagaagtccaata tgttgctttg 60 aggaatatcg accttctctt gcaagcgaag ccagacattctcagcaaaga actgagggtc 120 ttcttctgca aatataatga tcccccatat gttaaacttcagaaattgga gattatggta 180 cggatagcaa atgacaagaa tgttgatcaa ttgttatctgagctcaaaga atacgcactt 240 gaagtcgata tggacttcgt tcgaagagct gtcaaggctattggtcaagc agctataaaa 300 attgagagcg caagcgaaaa gtgtgtcaac acattactcgatttgattgc tacaaaggtc 360 aactacgtgg ttcaagaggc tattgtggtt attaaagatattttcagaaa gtatccaggt 420 tacgaaggaa tcatcccgac cttgtgcaag tacattgacgaattagatga gccaaatgca 480 agaggagctt taatttggat tgtcggcgag tatgctgagaagatcagcaa cgcagatgaa 540 attcttgcag gatttgttga agggttcatg gaagaatttacccaaactca acttcaaatc 600 ctgacagctg tcgtcaagtt atttttgaag aaaccggacaacaatcaagg tcttgttcaa 660 aaggtcttgc aagtatcaac ggccgaaaat gataaccctgatataagaga tagagcttat 720 gtatattggc gtttattatc aggtgatctt tcaatagcaaaggacatcat cctttccgaa 780 aagccaccca tcactacaac catgacatct cttccaccagctctcctcga acaactttta 840 ggtgaactca gcacgcttgc ttctgtctac cataaaccccctgagacatt cgttggccaa 900 ggtcgttacg gcgctgacgc cattcaacat gccgctatccaggaacaacg tcaaaatgct 960 gtcgaaaacc caattgcggc ggccgttgcc gcagccgcaaatggtactac tgctcaaaat 1020 aatgctgaga atctcctcga tattgatttt gatggtgcggctccagcaag tgcagatgcg 1080 cctccaactg cgggcacaag tggattggaa ggattagcaggcacgcctca aagagtagat 1140 agtccagctg cgggcgcaag tgcaggtggt aatatggcggatatgatggg tttatttgat 1200 gctcctattc ctactgcagg cggaatgggt ggcatgggtgggatgggaaa tgatatgatg 1260 aatggattcg cagggttgga tttgagtggt tctagtcaaccacctggtgc acagacacag 1320 ttacagcagg gtggtgggaa aaaaacgaca gaggatcttttgggaatgtt ttaggcttga 1380 agatgatgtt aagtcggtgg ggaaaagagg acgccggagcgggaagggtt tgtggtagtc 1440 aaggaaagag gatagcgaaa atgaaatgtt tgataagacagtcagtggtt agaagttagt 1500 aaaatcattg attgattgat tgattaaaaa aaaaaaaaaaaaaa 1544 <210> SEQ ID NO 194 <211> LENGTH: 457 <212> TYPE: PRT <213>ORGANISM: Glycine max <400> SEQUENCE: 194 Gly Thr Arg Pro Leu Val ThrLeu Val Ala Ser Ala Pro Glu Val Gln 1 5 10 15 Tyr Val Ala Leu Arg AsnIle Asp Leu Leu Leu Gln Ala Lys Pro Asp 20 25 30 Ile Leu Ser Lys Glu LeuArg Val Phe Phe Cys Lys Tyr Asn Asp Pro 35 40 45 Pro Tyr Val Lys Leu GlnLys Leu Glu Ile Met Val Arg Ile Ala Asn 50 55 60 Asp Lys Asn Val Asp GlnLeu Leu Ser Glu Leu Lys Glu Tyr Ala Leu 65 70 75 80 Glu Val Asp Met AspPhe Val Arg Arg Ala Val Lys Ala Ile Gly Gln 85 90 95 Ala Ala Ile Lys IleGlu Ser Ala Ser Glu Lys Cys Val Asn Thr Leu 100 105 110 Leu Asp Leu IleAla Thr Lys Val Asn Tyr Val Val Gln Glu Ala Ile 115 120 125 Val Val IleLys Asp Ile Phe Arg Lys Tyr Pro Gly Tyr Glu Gly Ile 130 135 140 Ile ProThr Leu Cys Lys Tyr Ile Asp Glu Leu Asp Glu Pro Asn Ala 145 150 155 160Arg Gly Ala Leu Ile Trp Ile Val Gly Glu Tyr Ala Glu Lys Ile Ser 165 170175 Asn Ala Asp Glu Ile Leu Ala Gly Phe Val Glu Gly Phe Met Glu Glu 180185 190 Phe Thr Gln Thr Gln Leu Gln Ile Leu Thr Ala Val Val Lys Leu Phe195 200 205 Leu Lys Lys Pro Asp Asn Asn Gln Gly Leu Val Gln Lys Val LeuGln 210 215 220 Val Ser Thr Ala Glu Asn Asp Asn Pro Asp Ile Arg Asp ArgAla Tyr 225 230 235 240 Val Tyr Trp Arg Leu Leu Ser Gly Asp Leu Ser IleAla Lys Asp Ile 245 250 255 Ile Leu Ser Glu Lys Pro Pro Ile Thr Thr ThrMet Thr Ser Leu Pro 260 265 270 Pro Ala Leu Leu Glu Gln Leu Leu Gly GluLeu Ser Thr Leu Ala Ser 275 280 285 Val Tyr His Lys Pro Pro Glu Thr PheVal Gly Gln Gly Arg Tyr Gly 290 295 300 Ala Asp Ala Ile Gln His Ala AlaIle Gln Glu Gln Arg Gln Asn Ala 305 310 315 320 Val Glu Asn Pro Ile AlaAla Ala Val Ala Ala Ala Ala Asn Gly Thr 325 330 335 Thr Ala Gln Asn AsnAla Glu Asn Leu Leu Asp Ile Asp Phe Asp Gly 340 345 350 Ala Ala Pro AlaSer Ala Asp Ala Pro Pro Thr Ala Gly Thr Ser Gly 355 360 365 Leu Glu GlyLeu Ala Gly Thr Pro Gln Arg Val Asp Ser Pro Ala Ala 370 375 380 Gly AlaSer Ala Gly Gly Asn Met Ala Asp Met Met Gly Leu Phe Asp 385 390 395 400Ala Pro Ile Pro Thr Ala Gly Gly Met Gly Gly Met Gly Gly Met Gly 405 410415 Asn Asp Met Met Asn Gly Phe Ala Gly Leu Asp Leu Ser Gly Ser Ser 420425 430 Gln Pro Pro Gly Ala Gln Thr Gln Leu Gln Gln Gly Gly Gly Lys Lys435 440 445 Thr Thr Glu Asp Leu Leu Gly Met Phe 450 455 <210> SEQ ID NO195 <211> LENGTH: 531 <212> TYPE: DNA <213> ORGANISM: Triticum aestivum<220> FEATURE: <221> NAME/KEY: unsure <222> LOCATION: (442) <400>SEQUENCE: 195 cagccaggac gctcctagct atggttttct cccagaatgt ctctcctggagcaccaaact 60 cgttactgca ggttgcagtg aagaataatc agcaacctgt gtggtatttcagcgacaaag 120 gcttgctgca tgtgttcttt ggtgaagatg gcaaaatgga gcggacgagttttcttgagg 180 cctggaaatc tttacctgat gacaatgaat tttcaaaaga ataccccaattctgtcatca 240 gcagcataga tgccaccgtt gaacaccttg cagcatcaaa tgtgttctttattgccaagc 300 ggaaaaatgc aaacatggat gtgctgtatc tgtctgcgaa ggtcccccgtgggattccat 360 ttctgataga gcttactgct gctgttggtg ttcctgggtg ccaagtgtgcagtcaagact 420 ccaaacaagg agttgtgcct cnatctttga agctatggag tctcttacaattaagtgacg 480 ggaaacttga gatcgtatcc gtcaaaatta tccggtatga cataacaagc g531 <210> SEQ ID NO 196 <211> LENGTH: 116 <212> TYPE: PRT <213>ORGANISM: Triticum aestivum <400> SEQUENCE: 196 Pro Asn Ser Leu Leu GlnVal Ala Val Lys Asn Asn Gln Gln Pro Val 1 5 10 15 Trp Tyr Phe Ser AspLys Gly Leu Leu His Val Phe Phe Gly Glu Asp 20 25 30 Gly Lys Met Glu ArgThr Ser Phe Leu Glu Ala Trp Lys Ser Leu Pro 35 40 45 Asp Asp Asn Glu PheSer Lys Glu Tyr Pro Asn Ser Val Ile Ser Ser 50 55 60 Ile Asp Ala Thr ValGlu His Leu Ala Ala Ser Asn Val Phe Phe Ile 65 70 75 80 Ala Lys Arg LysAsn Ala Asn Met Asp Val Leu Tyr Leu Ser Ala Lys 85 90 95 Val Pro Arg GlyIle Pro Phe Leu Ile Glu Leu Thr Ala Ala Val Gly 100 105 110 Val Pro GlyCys 115

What is claimed is:
 1. An isolated nucleic acid comprising a nucleotidesequence selected from the group consisting of: (a) an isolated nucleicacid encoding a polypeptide selected from the group consisting of SEQ IDNOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36,38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72,74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106,108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134,136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162,164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190,192, 194, and 196; and (b) an isolated nucleic acid sequence comprisinga complement of (a).
 2. An isolated polynucleotide comprising anucleotide sequence selected from the group consisting of: (a) a firstnucleotide sequence encoding a polypeptide of at least 80 amino acidsthat has at least 92% identity based on the Clustal method of alignmentwhen compared to a polypeptide selected from the group consisting of SEQID NOs:120, 122, 124, 126, 128, 130, 132, and 134; and (b) a secondnucleotide sequence comprising a complement of the first nucleotidesequence.
 3. The isolated polynucleotide of claim 2, wherein the firstnucleotide sequence comprises of a nucleic acid sequence selected fromthe group consisting of SEQ ID NOs:119, 121, 123, 125, 127, 129, 131,and
 133. 4. The isolated polynucleotide of claim 2 wherein thenucleotide sequences are DNA.
 5. The isolated polynucleotide of claim 2wherein the nucleotide sequences are RNA.
 6. A chimeric gene comprisingthe isolated polynucleotide of claim 2 operably linked to at least onesuitable regulatory sequence.
 7. A host cell comprising the chimericgene of claim
 6. 8. A host cell comprising the isolated polynucleotideof claim
 2. 9. The host cell of claim 8 wherein the host cell isselected from the group consisting of yeast, bacteria, and plant.
 10. Avirus comprising the isolated polynucleotide of claim
 2. 11. Apolypeptide of at least 80 amino acids that has at least 92% identitybased on the Clustal method of alignment when compared to a polypeptideselected from the group consisting of SEQ ID NOs:120, 122, 124, 126,128, 130, 132, and
 134. 12. A method of selecting an isolatedpolynucleotide that affects the level of expression of a phospholipase Dpolypeptide in a plant cell, the method comprising the steps of: (a)constructing an isolated polynucleotide comprising a nucleotide sequenceof at least one of 30 contiguous nucleotides derived from an isolatedpolynucleotide of claim 2; (b) introducing the isolated polynucleotideinto the plant cell; (c) measuring the level of the polypeptide in theplant cell containing the polynucleotide; and (d) comparing the level ofthe polypeptide in the plant cell containing the isolated polynucleotidewith the level of the polypeptide in a plant cell that does not containthe isolated polynucleotide.
 13. The method of claim 12 wherein theisolated polynucleotide consists of a nucleotide sequence selected fromthe group consisting of SEQ ID NOs:119, 121, 123, 125, 127, 129, 131,and
 133. 14. A method of selecting an isolated polynucleotide thataffects the level of expression of a phospholipase D polypeptide in aplant cell, the method comprising the steps of: (a) constructing theisolated polynucleotide of claim 2; (b) introducing the isolatedpolynucleotide into the plant cell; (c) measuring the level of thepolypeptide in the plant cell containing the polynucleotide; and (d)comparing the level of the polypeptide in the plant cell containing theisolated polynucleotide with the level of the polypeptide in a plantcell that does not contain the polynucleotide.
 15. A method of obtaininga nucleic acid fragment encoding a phospholipase D polypeptidecomprising the steps of: (a) synthesizing an oligonucleotide primercomprising a nucleotide sequence of at least one of 30 contiguousnucleotides derived from a nucleotide sequence selected from the groupconsisting of SEQ ID NOs:119, 121, 123, 125, 127, 129, 131, and 133 anda complement of such nucleotide sequences; and (b) amplifying a nucleicacid sequence using the oligonucleotide primer.
 16. A method ofobtaining a nucleic acid fragment encoding a phospholipase D polypeptidecomprising the steps of: (a) probing a cDNA or genomic library with anisolated polynucleotide comprising at least one of 30 contiguousnucleotides derived from a nucleotide sequence selected from the groupconsisting of SEQ IDNOs:119, 121, 123, 125, 127, 129, 131, and 133 and acomplement of such nucleotide sequences; (b) identifying a DNA clonethat hybridizes with the isolated polynucleotide; (c) isolating theidentified DNA clone; and (d) sequencing a cDNA or genomic fragment thatcomprises the isolated DNA clone.
 17. A composition comprising theisolated polynucleotide of claim
 2. 18. A composition comprising thepolypeptide of claim
 11. 19. An isolated polynucleotide of claim 2comprising a nucleotide sequence having at least one of 30 contiguousnucleotides.
 20. A method for positive selection of a transformed cellcomprising the steps of: (a) transforming a host cell with the chimericgene of claim 6; and (b) growing the transformed host cell underconditions which allow expression of a polynucleotide in an amountsufficient to complement a null mutant to provide a positive selectionmeans.
 21. The method of claim 20 wherein the host cell is a plant. 22.The method of claim 21 wherein the plant cell is a monocot.
 23. Themethod of claim 21 wherein the plant cell is a dicot.
 24. A method ofaltering the level of expression of a phospholipase D in a host cellcomprising the steps of: (a) transforming a host cell with the chimericgene of claim 6; and (b) growing the transformed host cell produced instep (a) under conditions that are suitable for expression of thechimeric gene wherein expression of the chimeric gene results inproduction of altered levels of a phospholipase D in the transformedhost cell.